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Consequences of Commitment to and Disengagement 
from Incentives 


Vor. 82, No. 1 


Eric Klinger 
University of Minnesota, Morris 


8 Behavior and experience are organized around the enjoyment and pursuit 
of incentives. During the time that an incentive is behaviorally salient, 
an organism is especially responsive to incentive-related cues. This sus- 
tained sensitivity requires postulating a continuing state (denoted by a con- 
struct, current concern) with a definite onset (commitment) and offset 
(consummation or disengagement), Disengagement follows frustration, 
accompanies the behavioral process of extinction, and involves an incentive- 
disengagement cycle of invigoration, aggression, depression, and recovery. 
Depression is thus a normal part of disengagement that may be either 
adaptive or maladaptive for the individual but is probably adaptive for the 
species. The theory offers implications for motivation; etiology, sympto- 
matology, and treatment of depression; drug use; and other social problem 


areas, 


This article proposes a theoretical frame- 
work for accommodating psychological as- 
pects of incentive relationships. The theory 
is primarily integrative rather than reduc- 
tive, describing consistent relationships 
among various coordinate classes of obser- 


` vations and concepts rather than proposing 


specific mechanisms for them at another 
level of analysis. Nevertheless, the theory 
does suggest something about the properties 
of underlying behavioral and neurohumoral 
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structures, provides analyses of several psy- 
chological problem areas, and generates spe- 
cific predictions concerning motivational, 
emotional, and cognitive functioning. 


CENTRAL STATUS OF INCENTIVE IN 
MOTIVATIONAL THEORY 


The theoretical development takes as its 
point of departure an organism's relation- 
ship to its incentives, defined here conven- 
tionally as objects or events that attract an 
organism or repel it. "The concept of in- 
centive has steadily gained prominence in 
motivational theory for the past three dec- 
ades at the expense of the two other con- 
cepts that have dominated the field during 
this time: drive to describe internal motiva- 
tional states and reinforcement (in the op- 
erant sense) to describe an organism's com- 
merce with its environment. Both of these 
concepts have encountered serious theoreti- 
cal difficulties, m. 

The problems of drive stem principally 
from the growing evidence that whereas 
drive states as conventionally defined are 
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neither necessary nor sufficient to launch 
sequences of goal-directed behavior, expec- 
tations that the behavior will lead to an 
incentive are both necessary and sufficient 
(Bindra, 1968, 1974; Bolles, 1967 ; Klinger, 
1971). Dissatisfaction with the drive con- 
cept is thus widening (see also Appley, 
1970; Atkinson, 1964; Cofer & Appley, 
1964). Although drive states affect activ- 
ity levels (Bindra, 1968), determine the at- 
tractiveness of at least certain kinds of in- 
centives (such as food) at the point of con- 
summation, and generate internal stimuli, by 
themselves they do not instigate instrumen- 
tal behavior or the content of fantasy and 
dreams, Vigorous behavior can begin in the 
virtual absence of detectable drive, and 
fairly stable behavioral Sequences can con- 
tinue despite drastic fluctuations in the de- 
gree and configuration of drive states. It is 
therefore necessary to find a concept other 
than drive to tie together the sometimes 
elaborate long-term sequences of goal- 
directed behavior. 
The concept of operant reinforcement has 
fallen on potentially even harsher times. It 
has been cogently argued that the concept 
is theoretically either unnecessary or erro- 
neous (Atkinson & Wickens, 1971; Bolles, 
1972; Estes, 1969, 1971; Tapp, 1969; 
Walker, 1969). While nobody denies that 
the operation of reinforcement (that is, dis- 
pensing rewards) often works to modify be- 
havior, the occasions on which it fails and, 
perhaps more important, the manner in 
which it often succeeds disconfirm the theo 
retical process of reinforcement, 
there is still recent support for t 
that reinforcement incrementall 
stimulus-response (S-R) bonds (eg. Ber- 
lyne, 1967; Perkins, 1968) and hence pro- 
duces S-R learning, there is increasing evi- 
dence that reinforcement does not affect 
S-R bonds at all (Atkinson & Wickens, 
1971; Estes, 1971; Levine, 1971: 
1971; Nuttin & Greenwald, 1968 
certain other processes that ultimately affect 
performance. Rewards and punishments 
provide feedback concerning the effective- 
ness of the organism's behavior, attract at- 
tention, determine what gets stored in short- 
term memory or transferred to long-term 


Although 
he notion 
y strengthens 


Logan, 
) but only 


memory, and determine where the organism 
will allocate its resources of time and effort. 
Since the effects of rewards and punish- 
ments can thus be attributed to a number 
of processes and events that have indepen- 
dent theoretical stature, these effects need 
not be attributed to any separate process 
called "reinforcement." 

By vesting in incentive-related processes 
certain crucial functions that have previ- 
ously been attributed to drive, and by limit- 
ing the status of reinforcement to operations 
that manipulate incentives, the argument 
places a heavy explanatory burden on the 
concept of incentive and hence raises press- 
ing questions about how to conceptualize 
the means by which incentives can exert 
such far-reaching effects. This article pre- 
sents an approach toward such a theory of 
incentive-related mechanisms by examining 
and conceptualizing the consequences to an 


organism of its organization around in- 
centives. 


a 
CONSEQUENCES OF COMMITMENTS 
TO INCENTIVES 


Two Constructs: Current Concern 
and Commitinent 


In relatively. simple species, pursuing a 
goal can more accurately be described as a 
two-fold process of, first, engaging in ac- 
tions that increase the likelihood of encoun- 
tering the goal object and, second, being 
biochemically and behaviorally receptive to 
consummating the goal object. Higher spe- 
cies, of course, retain those characteristics, 
but their goal striving depends less on im- 
mediate cues and is therefore more persist- 
ent. A mosquito wafted out of Sensory 
contact with a warm body is unlikely to 
retrace its steps in order to take up where 
it left off, but a dog that has lost sight of 
a thrown stick may search for a while, and 
a person trying to create an experimental 
result may go on doggedly for years search- 
ing for the right combination of conditions. 

An organism that can keep on pursuing 
a goal despite many changes in drive states 
and environmental cues, even in the face of 
repeated obstacles, must possess a continu- 
‘ng neural process that somehow maintains 
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the goal as a controlling influence on be- 
havior. In the terminology of this article, 
the hypothetical event that marks the onset 
of this process is called the organism's be- 
coming "committed" ! to that particular goal. 
The "state" of having become committed to 
a particular goal is here called a "current 
concern" about achieving the goal. By this 
definition, an organism may be in the throes 
of several or even numerous current con- 
cerns (and hence numerous states) at a 
time. For instance, a student may be com- 
mitted to pursuing a degree, completing cer- 
tain courses, maintaining or improving his 
relationship with a close friend, finding a 
part-time job, dinner, skiing at year's end, 
and so on. Each of the ends to which he is 
committed constitutes an incentive (because 
he is attracted to it) and a goal (because he 
is committed to striving for it), and about 
each he has a current concern (the construct 
that represents his state of being committed 
to that goal). 
The concept of current concern is, at this 
time in our theory building, a provisional, 
‘heuristic, nonmentalistic construct devised 
to label a discrete state: the state of the or- 
ganism between the time of commitment and 
the time of consummation or disengagement. 
The influence that states of concern exert 
on behavior depends on a number of varia- 
bles that can be investigated empirically, but 
these are not part of the concern construct 
itself. Hence, current concern is unlike 
Atkinson and Birch’s (1970) “action tend- 
ency.” After commitment its existence 
does not depend on arousal by cues, and it 
has no dimension of “strength” that depends 
on situational factors, although the behavior 


1 The construct “commitment” is formally some- 
what similar to Lewin’s (1928) concept of Vor- 
nahme. It may also seem to resemble Irwin's 
(1971) concept of “intention.” These terms, 
however, imply a property of being conscious 
which I do not wish to stipulate. Furthermore, 
Irwin defines intention as involving a three-fold 
correspondence among a subject’s preference for 
a specified outcome, an expectancy that a certain 

“act will lead to that outcome, and a second ex- 
pectancy that an alternative act would lead to a 
nonpreferred outcome, This definition lacks the 
temporal specificity of the commitment construct 
as the onset of a state. It seems best to distin- 
Buish the two concepts by using different terms. 


that arises during the state is, of course, af- 
fected by an array of moderator variables. 
"Thus, current concern is theoretically com- 
patible with such other concepts as action 
tendency, but it is formally different and it 
performs different functions theoretically 
and heuristically. 

It may be helpful in clarifying the con- 
cept of current concern to contrast it with 
some other concepts. It resembles Lewin's 
(1928) concept of "quasineed," but it does 
not share the distinction between quasi- 
needs and "genuine needs." It implies nar- 
rower classes of satisfiers than “need” 
(Murray, 1938) or “motive” (McClelland, 
Atkinson, Clark, & Lowell, 1953; e.g., com- 
mitment to setting a new sales record rather 
than to achievement). It differs from 
“drive” and older uses of "need" in elimi- 
nating connotations of both tissue deficits 
and of activity in neural circuits commonly 
identified with consummatory behavior or 
reward sites. Finally, it differs from “cen- 
tral motive state” (Bindra, 1974; Morgan, 
1943) in referring in each usage to a poten- 
tially enduring state rather than to one elic- 
ited by situational cues. Current concerns 
are conceived here to persist even during 
intervals when the organism is engaged in 
behavior irrelevant to the concern, although 
they continue to influence behavior in more 
subtle ways. 

Most previous research approaches have 
proceeded without a concept of commitment 
as an onset event warranting investigation 
in its own right, and apparently none has 
provided either evidence concerning the hy- 
pothetical process of commitment as such, 
apart from its sequelae, or independent mea- 
sures of it. At this point, therefore, the ex- 
istence of a commitment process must be 
inferred? as a logically necessary antecedent 


? Because there are at present no independent 
measures of commitment, operational definitions 
that an organism has become committed to some- 
thing must take one of two forms. First, one can 
observe some of the response classes that presuma- 
bly follow commitment such as detour behavior, 
preoccupation with an incentive, etc. This is not 
necessarily a circular procedure, since under the 
present theory knowing some of these behaviors 
permits prediction of others. A second porn of 
operational definition is to arrange conditions that 
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target word over a relatively brief time 
span, conforms with operations designed to 
produce effects of “instructional sets.” How- 
ever, existing evidence suggests that people 
can remain sensitized beyond the range of 
such a time frame. 


For instance, both waking and sleeping 
Subjects are continuously sensitive to their 


QN Wa, Ly, another dichotic listening 
experiment, subjects tended to notice in- 
structions on the nonattended channel when 
these were preceded by the subject’s name 
(Moray, 1959). This result can be taken 
to indicate shifts of attention to concern- 
related cues if one assumes that subjects 
have learned that mention of their names is 
usually relevant to one or another of their 
current concerns. These results cannot be 
explained as effects of instructional sets and 
could not have involved information in 
short-term storage. 

Tt could be argued that one's own name 
is so powerfully conditioned to arouse emo- 
tions that it is too nonspecific a stimulus to 
use for assessing the accompaniments of 
Current concerns, but other evidence from 
sleeping subjects suggests that the Moray 
results need not be explained solely in terms 
of conditioned emotional arousal. Subjects 
who were put to sleep in a laboratory with 
Instructions to clench their fists whenever 
they heard an arbitrarily selec 
their own) manage 
on one eighth of th 
very rarely clenche 
to 54 control n 
Treisman, 1960). 

Jne important issue r 


Concerns approach to d 
whether 


(or disengagement) or 
come responsive only 3 
NOUS, on, a particular 
it seems likely that 
emotions or expectan 
centive will incre 
cues related to it 


arousing a subject’s 
cies regarding an in- 
ase responsivity toward 
(Klinger, 1973), the eyi- 
dence suggests that not all of the increased 
responsivity to concern-related cues can be 
explained in this way. Thus, for instance, 
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subjects’ tendency in the experiment by Os- & 
wald et al, (1960) to clench their fists when 


they heard assigned names other than their 
own would have required arousal over such 
a long time period and under such unfavora- 
ble conditions (sleep) as to confirm the 
need for the kind of construct represented 
by the concept of current concern, Other 
kinds of behavior commonly observed out- 


side the laboratory, such as a sleeping moth- 
er’s specific sensitivity to her young xS 
noises or a sleeper's ability to wake 1970) 
approximately a preset time (Tart; "ea 
also suggest the need to postulate a dus à 
ing neural process that is not dependent m 
the kind of short-term arousal produced by 
situational cues or emotional responses. 
Some unpublished data that bear on these 
questions are beginning to emerge with the 
use of thought-sampling methods for the 
study of mental content. Subjects listened 
dichotically to tape-recorded narrations of 
fiction, segments of which had been re- 
written to allude unobtrusively to individ- 
ual subjects’ current Concerns or to mat- 
ters believed not to be concerns, Subjects 
spent more time attending to and were more 
likely to recall concern-related passages than 
nonconcern-related Passages. Second, the 
ts that they reported on these 
re likely to be themati- 
e preceding passages if 
een recalled than if they 
Passages Were concern 
bjects had not received 
to sensitize them to 
had been inferred 
€stionnaires admin- 
ays, their differential 


-related stimuli cannot : 
Srounds of 


be explai, 


hed on 
arousal, 


situational ` 


e preattentive 
: 1967) that control shifts 
attuned to material related 
Tent concerns, This, then, 
Serve as the mechanism that accounts 
he Preponderance of concern-related 


nt in fantasy and dreams. 


in attenti 
to 


on are 
à person's cur 
may 
for t 
conter 


i Effects on Perception and/or Recall 


There are by now many experiments on 
the effects of motivation and instructions on 
perception or perception-related responses 
(e.g., Haber, 1966; Jenkin, 1957; Postman, 
1953). Although this research has demon- 
strated a variety of means for reliably in- 
fluencing perception-related responses, it has 
so far not resolved the question of precisely 
what is affected: the perceptual experience 
itself, response biases, or the way in which 
the subject codes and organizes his memory 
of the percept (Haber, 1966). Information- 
processing analyses suggest a number of 
mutually compatible loci of influence (Er- 
delyi, 1974): search strategies, receptor 
modifications, encoding strategies, decisions 
regarding transfer to short- or long-term 
memory, and selection of overt responses. 
Most of these components are overt or co- 
vert instrumental acts and as such are 
clearly under the control of incentive ex- 
pectancies, "There remains the possibility, 
of course, that perception and recall are also 
enhanced motivationally in other ways. 

The existing evidence is insufficient to 
pinpoint the specific processing steps in per- 
ceptual reporting that are affected by moti- 
vational variables. Nevertheless, it can at 
least be concluded provisionally that stimuli 
related to a subject's current concerns seem 
more prominent than other stimuli (Jenkin, 
1957). Thus, for instance, subjects report 
need-related objects to look larger (Bruner 
& Goodman, 1947; Lambert, Solomon, & 
Watson, 1949) and brighter (Gilchrist & 
Nesberg, 1952) than they are, and attrac- 
tive or unattractive male faces seem more 
familiar than neutral faces (Gerard, Green, 
Hoyt, & Conolley, 1972). Subjects are also 
likely to interpret ambiguous or very briefly 
exposed stimuli as representing objects that 
are related to current needs (Atkinson & 
McClelland, 1948; McClelland & Atkinson, 
1948). 

Results such as these were once inter- 
preted as effects of drive. However, they 
can probably be better interpreted to reflect 
nondrive factors such as incentive expec- 
tancies, Saugstad (1966), for instance, con- 
cluded from a review of hunger effects on 
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perceptual and cognitive processes that "the 
important condition may not be hours of 
deprivation, but the expectancy of the Ss as 
to when they may next receive food" (p. 80). 


Integrative Discussion 


Up to this point, the theory as described 
accomplishes basically three kinds of things. 

l. It establishes a set of concepts that 
refer to a hypothetical state (current con- 
cern) with a starting point (commitment) 
and end point (consummation or disen- 
gagement). 

2. The concept of current concern con- 
stitutes a conceptual focus for the inter- 
relatedness of specified response classes (ac- 
tions, mental content, preattentive processes, 
and perception) that might otherwise seem 
quite diverse, and its use argues that with 
respect to such response classes the orga- 
nism is organized around its current con- 
cerns, 

3. The theory pinpoints important mo- 
tivational issues and facilitates analysis of 
a number of important problem areas. Each 
new concept raises fundamental questions 
concerning its parameters and the charac- 
teristics of the underlying process. (a) The 
underlying process presumably persists from 
commitment forward. Terminating it short 
of goal attainment requires an active coun- 
tervailing process. It incorporates semantic 
codes for specific incentive-related cues and 
cue classes and maintains these in long-term 
Storage. It contains comparator mecha- 
nisms for evaluating response feedback and 
determining goal attainment. (In the lat- 
ter respects, it operates similarly to a "plan" 
| Miller, Galanter, & Pribram, 1960].) (b) 
The finding that people are especially re- 
sponsive to concern-related cues (including, 
presumably, the concern-related cues emitted 
in the person's own previous thought) 
helps to explain the previously described 
effects of concern states on mentation 
(Klinger, 1971), since the increased respon- 
siveness to concern-related cues would itself 
increase responses to those cues, which are, 
of course, likely to be concern related. Ap- 
plying these concepts in this way generates 
further specific predictions about the effect 
of current concerns on the effectiveness of 
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creative thought and on the efficiency of 
problem-solving thought (fluency, distract- 
edness, effortfulness) in general. (c) The 
basic notion that organisms are behaviorally 
(including ideationally and affectively) or- 
ganized around incentives immediately raises 
the question as to what happens—to current 
concerns, to mental content, and to affect as 
well as to action—when a person finds him- 
self in a situation barren of incentives, (d) 
The theory raises empirical questions as to 
the variables that moderate the influence 
of concerns on action, attention, thought, 
and affect, Preliminary analyses suggest 
determining variables such as those already 
suggested by Expectancy x Value theories 
as well as others such as the amount of time 
before something must be done and previous 
investments of time and effort. (e) There 
are basic questions regarding the nature of 
the commitment process, such as whether it 
is in fact a discrete event or whether it con- 
stitutes a continuous variable. (f) A fur- 
ther important issue involves the degree to 
which a state of concern is specific with re- 
spect to the particular instrumental acts, 
particular time relationships, and other at- 
tributes of goal attainment, and how these 
specifications are incorporated, 


CONSEQUENCES or DISENGAGEMENT 
FROM INCENTIVES 


The conceptualization described thus far 
has dealt with the consequences of commit- 
ment as though the value of incentives re- 
mains constant, and it has said nothing 
concerning interferences with goal attain- 
ment. Tn fact, however, incentive values do 
fluctuate. They are most likely subject to 
habituation and to shifting adaptation levels, 
The incentive, once obtained, may not come 
up to expectation. Its value may vanish 
with a goal to which it was chained (Cos- 
tello, 1972). Unforeseen difficulties in pur- 
suing an incentive appear for 
crease its value. Depression d 
incentive values, 

Although it seems likely that all lapses 
from the expected accessibility, timeliness, 
and quality of an incentive affect its value, 
the discussion that follows will, for the sake 
of simplicity, focus on the consequences of 


a time to in- 
ecreases many 


complete failure, leading to the nullification 
of commitment. Given the powerful organi- 
zational properties possessed by states of 
current concern, it seemed likely that, when 
goals are unattainable or too costly and must 
therefore be abandoned without consumma- 
tion, ending the process set in motion by 
a commitment should itself be a rather in- 
volved process. Furthermore, deductions 
from the theory suggested that dissolving 
an important concern should tend to produce 
effects such as apathy, reduced instrumental 
striving, loss of concentration, and increased 
preoccupation with momentary cues, a pat- 
tern that is recognizably similar to de- 
pression. 

Fortunately, there is a large amount of 
evidence that can be brought to bear on this 
Process since blocked goal striving is the 
operational definition of frustration and, 
under many circumstances, of extinction. 
It leads to a variety of consequences besides 
a cessation of operant activity—invigoration, 
primitivization, aggression, and depression, 
The purpose of this section is to present an 
integrated description of these consequences, 
showing them to be not contradictory but 
rather different phases of an orderly process 
of ending commitment to an incentive, a 
Process of disengagement involving an in- 
centive-disengagement cycle. 

For purposes of exposition, the phenom- 
ena of the incentive-disengagement cycle 
are described in five categories or phases. 
Nevertheless, the cycle is intended to be de- 
picted as continuous rather than as a se- 
quence of discrete stages. Furthermore, 
there are circumstances in which the invig- 
oration or aggression phases may be attenu- 
ated or absent during disengagement, Fi- 
nally, a person may be in various phases 
of several different cycles simultaneously. 
each involving a different incentive, with the 
contemporaneous phases of the several cycles 
summating or interacting, 


Invigoration 


After an organism has launched an in- 
trumental sequence, it responds to an ob- 
stacle with increased Vigor, taking the form 
of more powerful Motions, more rapid re- 
Sponses, more intense concentration, and so 


' tration 
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forth. A form of the invigoration effect is 
well known under Amsel's label of "Frus- 
tration Effect" (Amsel, 1958, 1962).* That 
is, animals run faster in a second runway 
after nonreward than after reward in a 
first runway. When disappointed animals 
are put into an open field right after non- 
reward instead of into a second runway, 
their response vigor also increases (Gallup 
& Altomari, 1969). Retardates rock more 
after nonreward (Forehand & Baumeister, 
1971), and children respond more force- 
fully (Gilbert, 1969; Holton, 1961; Ryan & 
Watson, 1968), especially if they are stopped 
just short of (rather than far from) achiev- 
ing a goal (Holton, 1961). The frustra- 
tion effect is independent of whether Re- 
ward 1, which was omitted, resembles Re- 
ward 2 (Berger, 1969), the amount of time 
required to get the reward (Logan, 1968), 
or whether the animals responses are 
thwarted after nonreward (DiLollo, David- 
son, Hammond, & Donovan, 1968). Re- 


* Staddon (1970, 1972) has presented an incisive 
analysis of weaknesses in the experimental evi- 
dence for an energizing frustration effect. Staddon 
proposes an alternative explanation, that the frus- 

effect reflects not enhanced responding 
nonreward but inhibited responding aíter 
reward. However, his analysis does not seem to 
invalidate the argument presented here. Using an 
operant (key-pecking) design, he and some col- 
leagues have: shown that omitting reward for key- 
pecks at the end of fixed intervals increases re- 
sponse rate (that is, eliminates "scalloping") dur- 
ing the following intervals. However, when re- 
ward is contingent on not pecking (so as to pro- 
duce an intrainterval negative goal gradient), 
omitting reward reduces pecking (that is, in- 
creases not pecking) on the following intervals. 
This would seem consistent with the notion ad- 
vanced here that the first effect of frustration is 
to invigorate operant acts that have been learned 
to obtain reward. Staddon denies this explanation 
on grounds that when nonreward is accompanied 
by some of the cues (in his case, blackouts) that 
had accompanied reward, the following operant be- 
havior is enhanced less than after nonreward un- 
accompanied by those reward-related cues. How- 
ever, it can be assumed that reward-related cues 
would have acquired secondary incentive value, and 
would therefore (at least initially) dampen the 
extent of affective frustration, Results described 
below (Klinger, Barta, & Kemble, 1974) sup- 
port a response invigoration interpretation. and 
seem unaccountable in terms of Staddon's reward- 
dependent inhibition explanation. 


after 


tardates and children react more vigorously 
to frustration in accordance with the num- 
ber of times the response had been rewarded 
prior to nonreward (Forehand & Baumeis- 
ter, 1971; Holton, 1961). Mandler (1964) 
found that during extinction of an over- 
learned response, rats “exhibit both ex- 
ploratory behavior and repeated at- 
tempts to perform the interrupted sequence" 
(p. 196). People express more need for 
achievement in Thematic Apperception Test 
stories written right after failing at a task 
than after succeeding (McClelland et al., 
1953). Although it is still possible that in- 
vigoration effects would disappear if the 
organism could be certain that its chances 
of goal attainment had become nil, an in- 
vigoration effect clearly has been demon- 
strated in widely varying investigations. 

Frustration is reflected not only in be- 
havioral invigoration but also in the value 
of the blocked incentive. When people are 
frustrated they come to regard the incentive 
as more attractive than before, judging from 
ratings (Brehm, 1972; Mischel & Masters, 
1966) and from looking time, overestima- 
tion of size, and favorable statements about 
syllables that were associated with it (Knott, 
Nunnally, & Duchnowski, 1967). There is 
seemingly no evidence concerning what hap- 
pens to the value of other incentives while 
the value of the blocked incentive rises. 
However, adaptation level theory (Appley, 
1971) and incentive contrast phenomena 
(Adamson, 1971; Fowler, 1971) suggest that 
if one value rises sharply the other values 
should weaken. It therefore seems likely 
that during the invigoration and aggression 
phases not only does the blocked incentive 
become more attractive but also others be- 
come less so: the loss of “perspective” often 
noted in people who become over-involved 
in frustrated undertakings. 
Aggression 

There is little need to review here the 
extensive literature on the frustration-ag- 
gression effect (Dembo, 1931; Dollard et 
al., 1939). With appropriate qualifications 
(Berkowitz, 1962), frustration commonly 
produces aggression, „even though other 
events may also sometimes produce aggres- 
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sion and even though frustration need not 
always do so. With continued frustration 
responses become more stereotyped and 
more primitive until at last they often may 
be recognized as destructively aggressive. 
After frustration, children's play responses 
become more primitive (Barker, Dembo, & 
Lewin, 1941) as do adults responses in 
tests of originality and divergent thinking 
(Hinton, 1968). Extinction in pigeons 
(Azrin, Hutchinson, & Hake, 1966) and in 
people (Kelly & Hake, 1970) and non- 
reward in rats (Gallup, 1965) have specifi- 
cally been shown to produce eventual ag- 
gression. Even when Subjects report no 
anger following failure, their blood pressure 
rises (Gentry, 1970). When a personal 
catastrophe sends someone into a reactive 
depression, it is very likely to send him first 
into disbelief and angry protest, as in blind- 
ness (Fitzgerald, 1970) and separation of 
human children (Bowlby, 1973) and infant 
monkeys (McKinney, Suomi, & Harlow, 
1971) from their parents, 


Downswing into Depression 


At some point during sustained unre- 
lieved frustration, organisms begin to give 
up. The available evidence suggests that 
failure, loss, and disappointment (in the 
objective sense) generally lead to depres- 
Sion-like responses that range along a con- 
tinuum from mild disappointment (in the 
Subjective sense) to severe depression, 

The downswing into depression is prob- 
ably not a matter of the person 
his anger "inward." In one study of this 
issue, clinically depressed patients expressed 
more resentment than nonpatients, and both 
their resentment and their verbal hostility 
decreased as the depression lifted (Fried- 
man, 1970). When not depressed, the de- 
pression-prone members of t 
sponded with less verbal hostility than the 
matched controls. College student Subjects 
who provided daily self-ratings of mood 
yielded positive correlations between de- 
pression and anger both within subjects 
over time and across subjects (Wessman & 
Ricks, 1966). Thus, it seems clear that 


depression is accompanied by heightened 
hostility. 


“turning” 


his sample re- 


There are several lines of evidence to sup- 
port the notion of a frustration-depression 
sequence, After subjects are experimentally 
induced to fail, they describe themselves as 
more depressed (Ellner, 1970; Lish, 1969), 
and they express less self-esteem than after 
successful completion of a task (Lish, 1969). 
If failure is accompanied by much frustra- 
tion, the Thematic Apperception Test sto- 
ries subjects write immediately thereafter 
express reduced rather than heightened 
needs to achieve (Birney, Burdick, & Tee- 
van, 1969). 

Many kinds of personal catastrophes send 
people into periods of grief or depressed 
despair following the initial surge of pro- 
test: losses of personal relationships, roles, 
reputations, or "significant objects" (Aver- 
ill. 1968), becoming blind (Fitzgerald, 
1970) or suffering other disabilities that in- 
terfere with their pursuit or enjoyment of 
valued incentives (Orbach & Sutherland, 
1954; Talbott, 1970), and separation from 
parents (Bowlby, 1960; McKinney & al, 
1971). People are more likely to be hos- 
pitalized for depression when their com- 
munities offer few economic opportunities 
(Linsky, 1969). Depressed patients ex- 
perience a much higher proportion of up- 
setting events within a short time before 
their admission than a matched group of 
nonclinical subjects 5 (Paykel, Myers, Die- 
nelt, Klerman, Lindenthal, & Pepper, 1969), 
and one can almost always find precipitating 
events in the weeks before the onset of clini- 
cal depression, usually multiple events 
clustered over a relatively short time period 
(Leff, Roatch, & Bunney, 1970; Grinker, 
Miller, Sabshin, Nunn, & Nunnally, 1961). 

Precipitating events virtually 
terfere with the patient's 
for important positive inc 
his ability to avoid threatening consequences. 
The most common Precipitating events re- 
ported in the studies just mentioned are 


changes in sexual and marital relationships, 
ee 
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moves, job changes, losses of loved ones, 
*iflness, and the like. When Izard (1972) 
asked 332 college students to describe a 
situation in which they were depressed, his 
subjects most often mentioned academic 
failure or the prospect thereof (22%), dis- 
rupted heterosexual relationships (21%), 
loneliness (15%), “loss or failure in non- 
academic competition" (696), “death or 
illness of loved one or friend" (5%), and 
“difficulties with parents" (3%). The re- 
maining 2896 reported situations that com- 
bined several of these categories (1096) or 
reported miscellaneous other, often rather 
similar situations (1896). 


Depression? 

Whether a person's depression is mild, 
agitated, or retarded, he reduces or stops his 
instrumental activity while feeling helpless 
and hopeless (Bibring, 1953; Grinker et al., 
1961; Wessman & Ricks, 1966; and, in- 
deed, most clinical commentators on de- 
pression), Apathy toward normal incen- 
tives is one of the most characteristic ac- 
companients of depression (Beck, 1967) 
and grief (Marris, 1958). Depressed pa- 
tients direct and receive fewer social acts 
and time them less skillfully (Libet & 
Lewinsohn, 1973; Lewinsohn, 1974). It is 
unclear from these data which variable, so- 
cial skill or depression, is causal, but since 
normal subjects are less attracted to an 
“anonymous stranger” after having been 


®The foregoing and following passages clearly 
tread on important diagnostic issues concerning the 
existence of true endogenous clinical depression 
(that is, without external precipitators) and con- 
cerning whether endogenous and reactive depres- 
sion constitute one, two, or more dimensions, How- 
ever, these issues are complex, the evidence bear- 
ing on them is inconclusive, and the issue of 
dimensionality is not central to the argument pre- 
sented in this article. Discussion of them is there- 
fore omitted. The argument proceeds with de- 
pression defined as an affective state or variable 
rather than as a diagnostic entity (although it 
might be inferred from a diagnostic classification). 
Since depression is presumably mediated by a 
complex system of neural and humoral compo- 
nents, and since a variety of organic factors can 
disorder this system in the absence of precipitating 
factors, the present formulation does not exclude 
possible etiologies of depression other than that 


under consideration here. 


shown a depressing film (Gouaux, 1971) 
and are less attracted to excitement during 
depressed moods (Ludwig, 1970), and in 
view of the general drop in incentive values 
commonly observed in clinical depression 
(Beck, 1967), it seems likely that depres- 
sion impairs social functioning by attenuat- 
ing its motivational base. 


Recovery 


There is apparently not much evidence as 
to what terminates depressions other than 
antidepressant medications and electrocon- 
vulsive shock. Recovery usually does occur, 
especially from reactive depressions, even 
without specific therapy, but the natural 
precipitators of recovery are still unknown. 
At some point in the cycle, successes of some 
kind most likely play a role. Despite the 
fact that apathy (including loss of appetite 
for food and sex) is one of the pervasive 
clinical characteristics of depression, in at 
least one study depressed patients were 
found unusually responsive to success in 
minor tasks (Beck, 1967; Loeb, Beck, & 
Diggory, 1971), and therapy based on im- 
proving patients’ social skills is claimed to 
have a positive effect (Lewinsohn, 1974). 
There is no hard evidence with which to 
determine whether people are especially re- 
sponsive to successes throughout their de- 
pressions or whether they become increas- 
ingly responsive as their depression takes its 
course, a view favored by Jacobson (1971). 

It would be interesting to know more 
about what happens to the relative value of 
the lost incentive during the depression 
phase. It seems clear enough that the value 
of other incentives remains low, as though 
there were a carry-over or perhaps even a 
deepening of their devaluation from the in- 
vigoration and aggression phases. How- 
ever, there is clinical evidence to suppose 
that the value of the lost incentive remains 
high at least during the early part of be- 
reavement, Parkes (1972) points out the 
continued searchlike behavior and affect of 
the bereaved. It may be that depression 1n- 
volves a gradual decline in the affective 
value of the lost incentive and a gradual 
shift in adaptation level to permit a recovery 


of other values. 
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AN INCENTIVE-DISENGAGEMENT CYCLE 


The evidence presented in the previous 
Section demonstrates that loss of incentives 
leads to several specifiable effects: invigora- 
tion, primitivization of response, aggression, 
and depression, from which there is usually 
an eventual recovery. However, the data 
concerning the various effects come from a 
variety of sources: primarily experimental 
evidence for invigoration and aggression and 
primarily clinical evidence for depression. 
What evidence is there that these various 
effects of incentive loss constitute a co- 
herent, orderly cycle? 


Evidence for a Cycle 


A few of the naturalistic and clinical stud- 
ies already cited, which followed subjects 
longitudinally or tried to reconstruct the 
sequence of events retrospectively, strongly 
suggest the cyclical nature of the effects. 
Work with separation of mothers from in- 
fant monkeys (McKinney et al., 1971) and 
from human infants (Bowlby, 1973), and 
studies of bereavement (Marris, 1958; 
Parkes, 1972) and of blindness (Fitzgerald, 
1970) all report behavior that seems to cor- 
respond to an initial stage of invigoration 
(activity or disblief) followed by a stage of 
aggression (protest or blame) which shades 
into a period of depression (grief or ap- 
athy). Eventually, if the person’s life situ- 
ation offers some alternative incentives that 
seem accessible, the depression lifts and the 
person readjusts, The similarity of these 
Sequences in samples of quite different ages, 
in different life situations, and even in dif- 
ferent primate Species suggests that the se- 
quence is rooted in the Structure of mam. 
malian (or at least primate) motivation, 

The regularity of the findings su 


1 n ggested 
that it would be worthwhile to look for- such 
a sequence in another species with another 


kind of incentive (Klinger, Barta, & Kem- 
ble, 1974), Building on a finding that 
open-field activity of rats is higher just after 
unrewarded runway trials than after re. 
warded ones (Gallup & Altomari, 1969) , 
the procedure measured open-field activity 
of hooded rats once after a runway acquisi- 
tion trial and following four evenly spaced 


runway trials during extinction, Four 
groups of rats varied as to which particular 
set of runway extinction trials were followed 
by open-field trials so that every runway ex- 
tinction trial was followed by an open-field 
trial with one of the four groups. A fifth 
(control) group remained on a continuous 
reward schedule throughout. The critical 
tests of significance were the cubic com- 
ponent of the open-field activity trend for 
the four experimental groups (p< .001), 
the cubic component of the Groups Trials 
interaction comparing the four combined ex- 
perimental groups with the control group 
(P <.025), and the difference of the ex- 
perimental groups’ fourth (depressed) ac- 
tivity trial means from their acquisition and 
recovery means and from the controls’ mean 
(p <.05 in each case). Clearly, all phases 
of the cycle except aggression are reflected 
in these data (see Figure 1j. 

The results support the theory that the 
incentive-disengagement Sequence consti- 
tutes an orderly cycle. Thus, the phe- 
nomena of extinction, frustration, invigora- 
tion, depression, grief, and at least certain 
forms of aggression are brought together 
under a single theoretical roof within the 
larger framework of a particular incentive 
theory of motivation. 

Further. considerations. The fact that a 
certain sequence of events unfolds lawfully 
within certain conditions, while theoreti- 
cally important, does not by itself demon- 
Strate that the phases of the sequence are 
governed by a single cohesive set of mecha- 
nisms. Existing evidence is insufficient to 
establish the unitariness or interrelatedness 
of the mechanisms that might subserve such 
a sequence, There are, however, some in- 
dications that implicate norepinephrine and 
the amygdala. The evidence concerning the 
role of norepinephrine and the cycle is still 
largely | speculative: Whereas aggressive 
States are associated With increased norepi- 
nephrine (Kety, 1970), there are indica- 
tions that depression is associated with nor- 
epinephrine depletion (Schildkraut, 1965)- 
i The evidence implicating the amygdala E 
incentive-related (and cycle-related ) dut. 
tions is considerably stronger. Ablation o 
the amygdaloid complex : 


INCENTIVE RELATIONSHIPS 13 


ACQUISITION | EXTINCTION 


@ CONTROL (c) 
O GROUP 1 (61) 
© GROUP 2 (62) 
@ GROUP 3 (63) 
Q GROUP 4 (64) 


MEAN ACTIVITY LEVEL 


w 
E 
I 


12 24 36 48 1 2 


a 
= 
= 
> 
me 
=] 
> 
= 
> 
= 
[9] 
= 
= 
< 
mi 
= 


8 


TRIALS 


7 8. 9* 10 1j 12, 43 "12" 46 1745 


TRIALS 


Freure 1. Activity levels (number of 10-cm squares entered per 60 sec) as a function of 


preceding runway trials. 


groups whose separate curves are presented in the upper right. 


(The large curve is a composite of data from four experimental 


The unconnected black dots 


represent the activity levels of Control Group 5, also presented in the extreme upper right. 


The second point from the left in the large curve represents identical acti 
From "Cyclic Activity Changes During Extinction in Rats: A 


2 and Group 5 (control). 


ity means for Group 


Potential Model of Depression,” by E. Klinger, E. D. Kemble, and S. G. Barta, Animal Learn- 
ing and Behavior, 1974, 2, 313-316. Copyright 1974 by the Psychonomic Society. Reprinted 


by permission.) 


produces profound "taming" of wild animals (e.g., 
Schreiner & Kling, 1953, 1956), deficiencies in a 
variety of aversively motivated learning tasks 
(e.g, Ursin, 1965, Horvath, 1963), and reduced 
responsivity to various positive (e.g, Kemble & 
Schwartzbaum, 1969, Kemble & Beckman, 1970) 
and negative (Kemble & Beckman, 1969) incen- 
tives. Finally electrical stimulation within the 
amygdala results in well-defined emotional be- 
havior such as attack or fleeing (eg, Ursin & 
Kaada, 1960). (Kemble, 1973) 7 


Mark and Ervin (1970) have reported in- 
stances in which seizure-produced rages 


gave way to episodes of depression. Other 
clinical evidence implicates the amygdala in 
many such seizures. Finally, amygdalec- 
tomy in rats abolishes the Amsel frustration 
effect (Henke & Maxwell, 1973) and all 
phases of the incentive-disengagement cycle 


* Unpublished communication (November 23, 
1973) from Ernest D. Kemble, to whom I am 
deeply indebted for continually stimulating discus- 
sions on the neuropsychological aspects of incen- 


tive-related behavior. 
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as measured by open-field activity levels fol- 
lowing runway extinction trials (Barta, 
Kemble, & Klinger, in press). The amyg- 
dala, of course, is a complex entity, and it 
participates in systems that extend anatomi- 
cally well beyond it. Whatever its role may 
turn out to be, a distinguishing feature of 
the present theoretical approach is to empha- 
size the specific interconnectedness of the 
processes involved in commitment to and 
disengagement from incentives, and hence to 


prescribe a search for interconnected mecha- 
nisms. 


Duration and Intensity 


Incentive-disengagement cycles would 
seem to vary greatly in duration and in- 
tensity, ranging from minor disappointments 
whose effects dissipate within hours to major 
depressions that last for months. One might 
well wonder whether such quantitatively 
different phenomena should be regarded as 
manifestations of similar processes, but the 
similarity in their antecedents and course 
suggests that they should. There is, un- 
fortunately, very little evidence concerning 
what determines the length and intensity of 
such cycles. Presumably, duration and in- 
tensity depend on individual differences and 
on the nature and timing of the precipitating 
loss. General observation suggests that 
greater invigoration, aggression, and depres- 
sion follow the loss of more important in- 
centives. All phases also presumably de- 
pend on cognitive factors such as the per- 
son's belief in the reality and finality of a 
loss, his estimate of how costly and effective 
aggression might be, and his assessment of 
the substitutability of alternative incentives. 
For instance, infants seem more distressed 
at separation from someone to whom they 
have formed a strong attachment, and their 
distress is less severe if they are quickly 
transferred to an effective mother surrogate 
(Bowlby, 1973). 

Severity of a loss probably depends on the 
extent to which the lost incentive was capa- 
ble of eliciting positive affect and on the 
proportion of the organism's response reper- 
tory that was shaped specifically to obtain 
and enjoy the incentive (Lewis, 1961; 
Parkes, 1972). "Thus, a close personal re- 


lationship or a gratifying occupation would 
be among the most distressing kinds of in- 
centives to lose. The duration of the depres- 
sion phase of the cycle may depend addi- 
tionally on the number and prominence of 
the cues remaining in the organism's envi- 
ronment that were associated with the in- 
centive. Such cues reduce the frustration 
effect (Staddon, 1970, 1972) and lengthen 
extinction (e.g., Melching, 1954). In be- 
reavement, such reminders are often espe- 
cially painful (Marris, 1958). Perhaps. 
then, they generally retard the progress of 
disengagement. Despite these indications, 
there is clearly a need for advances in both 
theory and research to establish the param- 
eters of the cycle. 


Adaptive Functions 


Adaptive junctions for individual survival. 
The most general characteristics of the in- 
centive-disengagement cycle seem deducible 
from a priori considerations of an organism's 
requirements for survival. First, an orga- 
nism would undoubtedly benefit from a 
tendency to pursue vital incentives tena- 
ciously, A number of attributes serve this 
function: The pleasant states that accom- 
pany “primary reinforcement" appear to 
form a part of a system for establishing the 
psychological value of certain essential in- 
centives; an organism's responsivity toward 
concern-related stimuli provides a basis for 
reminding it of its incentives; and, in gen- 
eral the mechanisms underlying persistent 
states of current concern keep the organism 
as much as possible on target toward its 
goals. Moreover, if one method for attain- 
ing an incentive is blocked, the organism 
would clearly benefit from a tendency to at- 
tempt other methods and, if its repertory of 
efforts fails, to try again with greater physi- 
cal and psychological force: the invigoration 
and aggression phases. On the other hand, 
Such an organism would totally exhaust it- 
self pursuing its first blocked goal if it di 
not incorporate a mechanism for annulling 
its commitment to an inaccessible incentive, 
thus ending that particular current concern. 
If, however, rest or the end of hard striving 
carries positive incentive value, it would Bs 
important that the disengagement process s 
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affectively aversive in order to prevent fail- 
ure from taking on a net positive incentive 
value, which would surely be a dangerous 
state of affairs. Depression, of course, em- 
bodies the properties of such a mechanism. 
Within this framework, then, some degree 
of depression is a normal, adaptive part of 
the process of disengagement from incen- 
tives. Since perpetual depression would 
leave the organism incapacitated, there is 
obvious need for a recovery phase. 

Adaptive functions for population survival. 
In some important respects, depression is in- 
dividually maladaptive. Not only are de- 
pressed individuals often hypochondriacal 
(Jacobs, Fogelson, & Charles, 1968), the be- 
reaved also often experience actual deteri- 
orating health and higher rates of morbidity 
(Marris, 1958) and mortality (Rees & Lut- 
kins, 1967; Young, Benjamin, & Wallis, 
1963). Life changes that require major 
social readjustments increase the risk of 
subsequent serious illnesses, especially of 
chronic illness (Holmes & Masuda, 1973). 
Most of the seven most heavily weighted 
events in Holmes and Masuda's index of life 
changes are known to produce a high inci- 
dence of depression, and most involve losses 
of incentives or threats of loss: death of 
spouse, divorce, marital separation, jail 
term, death of close family member, per- 
sonal injury or illness, and marriage. 

In the face of these grim statistics, how 
can the depression phase of the cycle be 
adaptive? Viewed from a population stand- 
point, there are several advantages to the 
demise of depressed individuals. First, se- 
vere depression is a consequence of losing 
a major incentive, which in a natural setting 
is likely to involve loss of a mate, progeny, 
or territory. Insofar as the loss is related 
to relative incompetency on the part of the 
individual, there is a eugenic advantage in 
eliminating the incompetents. Insofar as the 
individual has been prevented from repro- 
ducing while still laying claim to a share of 
the food supply, there is a reproductive ad- 
vantage in his demise. Thus, a characteris- 
tic disadvantageous to certain individuals 
may benefit the species and may therefore 
be selected for in the course of evolution. 

Price (1972) has suggested two other in- 
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triguing arguments for the biological ultility 
of depression. First, if incentive loss can 
be attributed not to incompetence but to an 
inhospitable environment, it would be useful 
for an individual to respond as he might to 
other instances of environmental inhospi- 
tality by minimizing the metabolic cost of 
waiting for easier times. Price draws a 
parallel between "stay-put" depressions and 
hibernation, based in part on several be- 
havioral similarities (including, of course, 
torpidity) and a few biochemical indications. 

'The second argument involves the obser- 
vation that a number of primate species have 
adaptively ritualized competition for domi- 
nance, reducing potentially lethal battles to 
symbolic displays. However, if the loser of 
such a symbolic contest 


has merely received signals, he is physically fit 
and might choose to ignore the ritual contest and 
engage in destructive fighting, or he might claim 
that the ritual was null and void in some way 
and insist on repeating it. This sort of behavior 
would counteract all the gains to be derived from 
ritualized fighting; in fact, the ritualization of 
fighting requires that the loser should yield. . . . 
The simplest and most effective way that he can 
be induced to yield is by being incapacitated. It 
does not matter whether he is physically or psy- 
chologically incapacitated, except that in the latter 
case recovery is swifter and more complete. (Price, 
1972, p. 135) 


Research by Gartlan & Brain (1968) and 
by Price (1972) has shown that the vervet 
monkey undergoes marked physiological and 
behavioral changes following a loss in a 
competition for social dominance, including 
depressed-appearing behavior and increased 
mortality. 

Depression could, then, serve a number 
of adaptive functions for individuals and for 
species, and it may have evolved as a com- 
promise mechanism serving several of them. 


FURTHER [PLICATIONS FOR PSYCHOLOGICAL 
THEORY AND APPLICATION 

The concept of an incentive-disengage- 
ment cycle and related concepts articulate 
with a wide network of theory and practice. 
Their implications, therefore, extend. in a 
number of different directions. The impli- 
cations for motivational theory have already 
been suggested above. The present section 
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summarizes implications for theory and 
treatment of depression, for the psychology 
of drug use, and for other social problem 
areas, 


Implications for a Theory of Depression 


Clinical status. The formulation pre- 
sented above suggests that in most cases de- 
pression is a normal, adaptive, nonpatho- 
logical process which, despite its being a 
nuisance, need cause little concern for the 
psychological viability of the depressed indi- 
vidual. If one were to draw a physical anal- 
Ogy, it is more akin to a painful bruise than 
to a broken leg or to varicose veins. In se- 
vere cases depression, like a bruise, might 
call for clinical support, and it also carries 
an increased risk of precipitating a more 
serious pathology. Nevertheless, the de- 
pressive process as such can be seen as es- 
sentially part of recuperating from a loss. 

This is not to deny the reality of disabling 
depressions. There are, no doubt, learned 
dispositions or constitutional factors that in- 
crease the severity of the depressive phase 
of the cycle or impair the person’s ability to 
recover. Depressions that occur in conjunc- 
tion with various other processes such as 
endocrine dysfunctions and Schizophrenia 
require special treatment approaches. Never- 
theless, the etiological view presented here 
Suggests particular treatment strategies for 
"normal" depression, involving educational 
and supportive help to provide the person 
with a time perspective, to assist in an anal- 
ysis of possibly maladaptive goal commit- 
ments, and to reduce anxiety, shame, etc., 
regarding the depression itself, 

The present view raises à question as to 
whether a depressive phase not only accom- 
panies disengagement from incentives but 
also is necessary for successful disengage- 
ment from incentives to occur. It suggests 
the importance of discovering to what extent 
various pharmacological and electroconvul- 
Sive therapies interfere with the patient's 
disengagement from incentives, Remedies 
that interfere with disengagement may have 
either desirable or undesirable effects, such 
as sending back into contention Someone 
Who is too easily discouraged or reviving a 
commitment to a goal that were better aban- 


doned. In either case, a physical therapy 
unaccompanied by other help may forestall 
a person's examining the nature of his com- 
mitments, which may be necessary for last- 
ing therapeutic effect. 

Explanation of depressive symptoms, The 
formulation presented here provides direct 
explanations for at least four prominent ac- 
companiments of depression: hopelessness, 
lowered self-esteem, links to aggression and 
hostility, and hypochondria. Since depres- 
sion is viewed specifically here as disengage- 
ment from an incentive that seems hope- 
lessly out of reach, the affect of hopelessness 
and the cognitive belief in one’s helplessness 
can be seen as a generalization from the 
specific loss. If the value of the lost incen- 
tive becomes exaggerated as suggested 
above, the person’s loss would appear to 
him more extensive than the original im- 
portance of the incentive warrants, espe- 
cially if his sense of self-worth was predi- 
cated on effectiveness in controlling incen- 
tives. 

The proposed model makes clear that both 
aggression and depression are products of 
frustration, but neither is viewed as a trans- 
formation of the other, 

Finally, the concept of the cycle together 
with the theory of motivational effects on 
ideation explains the frequent finding of hy- 
pochondriacal symptoms in conjunction with 
depression (e.g., Jacobs et al., 1968). With 
the marked weakening of current concerns 
about external incentives, one would expect 
a relative increase in attention to intero- 
ceptive stimuli, which would then elicit 
a larger share of fantasy segments con- 
cerning them. Along with the preoccupa- 
tion with bodily sensations, it is easy to see 
how the anxiety that often accompanies de- 
pression. (Claghorn, 1970; Grinker et al., 
1961) and the secondary gains that accom- 
pany the sick role would combine to produce 
a high incidence of hypochondria, 

Mood stability and instability, Most peo- 
ple go about their lives with few major mood 
shifts, their affect varying within a tolerable 
range from the “blues” to mild elation, de- 
Spite occasional disappointments. Clinically 
depressed patients are often reported to have 
experienced a cluster of disturbing events in 
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preceding months (e.g., Leff et al., 1970). 
The concept of an incentive-disengagement 
cycle can accommodate these observations 
with the assumption that a person can ex- 
perience a separate cycle for each incentive 
to which he is committed and that the ef- 
fects of the different cycles can summate at 
each moment. If, therefore, a person who 
has suffered a serious reverse has many 
other irons in the fire and perhaps some- 
thing going well for him, the model would 
predict a dampening of elation about the 
things going well rather than an experi- 
enced depression. On the other hand, if 
the depressive phases of several cycles were 
to coincide in time, the model would predict 
some degree of depression even if none of 
the incentives is, by itself, of paramount im- 
portance, The model thus provides a frame- 
work for understanding fluctuations in 
mood. 

Individual differences. The effects of the 
incentive-disengagement cycle probably de- 
pend on individual differences in constitu- 
tion (Rosenthal, 1970), cognition, and other 
consequences of learning histories. The 
concept of the incentive-disengagement cycle 
provides a framework for analyzing the di- 
mensions and sources of individual differ- 
ences in susceptibility to depression. The 
five "phases" of the cycle each suggests dif- 
ferent sources, For instance, the invigora- 
tion phase suggests that the onset of the 
aggressive phase may be hastened by posses- 
sion of a limited response repertory or the 
phase may be truncated by interpretations 
that lead to early hopelessness such as cer- 
tain kinds of causal attributions. The nature 
of the aggression phase suggests that his- 


„tories of reward or punishment for aggres- 


sive behavior affect its duration since they 
would affect the amount of unrewarded ef- 
fort necessary to crush hope. The two 


phases jointly suggest a search for factors 


that make people differentially extinguish- 
able. For instance, Amsel (1972) has re- 
cently reported research on extinguishability 
and persistence as a function of partial re- 
inforcement training in rats. Dweck and 
Repucci (1973) have found that children 
who attribute success or failure to their own 
efforts are less likely to succumb to “learned 


helplessness.” Under extinction conditions, 
subjects who attribute their failure to low 
ability or task difficulty become much more 
pessimistic about their chances of success 
than subjects who attribute their failure to 
lack of effort or bad luck (Weiner, Heck- 
hausen, Meyer, & Cook, 1972). The de- 
pressive downswing phase suggests a search 
for constitutional factors and also for factors 
involving number and variety of a person's 
simultaneous commitments to incentives, 
since it seems likely that a downswing will 
be less severe if the individual has many 
other promising commitments. Finally, the 
recovery phase suggests a search for factors 
that help the depressed individual find sub- 
stitute incentives and gratifications. 

Inner experience during bereavement and 
depression. “The most characteristic feature 
of grief" writes Parkes (1972), “is not 
prolonged depression but acute and epi- 
sodic ‘pangs’... . At such a time the lost 
person is strongly missed and the survivor 
sobs or cries aloud for him" (p. 39). Fur- 
thermore, bereaved people frequently find 
themselves thinking thoughts of a kind that 
had become habitual (such as whether a 
mate will like something the bereaved per- 
son is doing) but which are now inappro- 
priate (Marris, 1958). When the theory 
of depression presented here is combined 
with the theory of inner experience detailed 
elsewhere (Klinger, 1971, Note 1, Note 2), 
the larger theoretical structure can account 
for these phenomena. First, bereavement 
falls within a period of disengagement dur- 
ing which the person still retains a current 
concern about the lost individual. Describ- 
ing both clinical and ethological evidence, 
Parkes (1972) concludes that "[pining] is 
the subjective and emotional component of 
the urge to search for a lost object" (p. 40). 
Second, however, segments of thought about 
a concern, along with their associated affect, 
seem to be induced when the person en- 
counters cues (both sensory and ideational) 
related to those concerns. Third, the seg- 
ments that then unfold are integrated 
response sequences established — through 
previous overlearning. Thus, within this 
framework, segments that occur as if the 
lost person were still alive can be under- 
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stood as the perservation of integrated se- 
quences induced by a still-alive concern and 
customary cues. Similarly, pangs of grief 
can be viewed as emotionally laden segments 
induced in the same way as other segments 
of thought. 

Since depression carries with it indiffer- 
ence to most incentives, it involves a weak- 
ening of most current concerns. If current 
concerns govern responsiveness to cues and 
govern the content of fantasy, the model 
predicts that the severely depressed indi- 
vidual will be relatively unresponsive to 
most stimuli since most stimuli are irrele- 
vant to him, even though sensory imagery 
may make up a large share of his conscious 
experience. In addition to a drastically re- 
duced level of operant activity, the depressed 
person's weaker affective responsiveness to 
most external stimuli should lead to his en- 
gaging in more repetitive, stale, drab fan- 
tasy, much of it perhaps cued off by somatic 
sensations and bodily concerns. The model 
thus provides some specific testable hypothe- 
Ses regarding depressive inner experience, 
Since in accordance wth the larger model 
these factors similarly influence operant 
thought, the model also can generate specific 
predictions regarding the impairment of the 
depressed person's ability to cope with the 
exigencies of his life situation. 

The theory of depression as learned help- 
lessness. Recently, Seligman (1974) pro- 
posed a theory of reactive depression as 
"learned helplessness,” The theory grew 
out of observations that animals subjected 
to inescapable electric shocks in one situa- 
tion fa Pavlovian hammock) were slower 
to initiate escape Tesponses in a second situa- 
tion (a shuttlebox) and slower to learn es- 
cape responses after beginning to emit them 
(Overmeir & Seligman, 1967; Seligman & 
Groves, 1970; Seligman & Maier, 1967). 
Other studies have similarly found that suf- 
fering inescapable discomfort retards escape 
learning in humans (Hiroto, 1971, Racin- 
skas, 1971), although one study (Thornton 
& Jacobs, 1971) found no significant differ- 
ences between an inescapably shocked group 
and a nonshocked control group. Seligman 
explains the subjects’ impaired escape learn- 
ing as arising from learning that the aver- 


sive stimulation is noncontingent on their 
responses—that they are helpless to prevent 
discomfort. Presumably, learned helpless- 
ness is, in this view, the critical event that 
triggers reactive depression. Like depressed 
individuals, organisms subjected to inescap- 
able stress are passive and apparently pessi- 
mistic. Both have experienced uncontroll- 
able stress, There is some evidence that 
both conditions are accompanied by norepi- 
nephrine depletion, Primarily on the basis 
of these classes of evidence, Seligman pro- 
poses "learned helplessness" as a model of 
reactive depression. 

Seligman's position and the present theory 
agree that both extinction and inescapable 
Stress are depressing. Seligman’s evidence 
rests primarily on the effects of inescapable 
Stress, but his theory considers the operative 
factor to be lack of control over reinforcers, 
Which includes extinction of appetitive be- 
havior. The theory of the incentive-disen- 
gagement cycle rests on evidence concerning 
the effects of appetitive extinction, failure, 
and incentive losses, but these categories 
logically include losses of comfort and fail- 
ures to ensure safety, Seligman | stresses 
that the sense of control over events is itself 
valued. The theory of current concerns 
Stresses the organizing effect of contingent 
incentives on behavior and on the content 
of thought and affect. Thus, both positions 
agree concerning the importance of an in- 
centive's controllability, although the reasons 
for regarding it as important seem to differ. 

Apart from these similarities, there are at 
least three areas of disagreement. First, is 
the passivity of a depressed organism due to 
its having learned it is helpless or is it an 
unlearned accompaniment of a more com- 
prehensive state? Second, is that state pre- 
cipitated by helplessness as such or by the 
loss of incentives? Third, can teaching a 
Person confidence in his powers to control 
events ("mastery training") reduce the de- 
Pression he experiences following losses? 
These are important issues that bear some 
discussion, 

Concerning the first of these issues, there 
are at least two kinds of observations that 
seem to pose problems for the view that A 
passivity of depression is learned directly. 
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First, depression afflicts many people who, 
despite their loss of some significant incen- 
tive, had been succeeding in their pursuit 
of one or more other significant incentives. 
Thus, the person who loses a mate but re- 
mains successful in his or her work, or who 
loses a job while remaining on good terms 
with loved ones, may become depressed de- 
spite evidence of his or ability to control 
various incentives. Why should his sense 
of helplessness in one restricted life sector 
generalize to so many other sectors? 

Even more difficult to accommodate 
within the framework of learned helpless- 
ness, the recovery phase of the incentive- 
disengagement cycle has been observed in 
rats that were still being subjected to con- 
tinuing extinction trials (Klinger et al., 
1974; see Figure 1). If their depressed 
levels of open-field activity are consequences 
of learned helplessness, why did the de- 
pression dissipate while the lesson of their 
helplessness continued—that is, while their 
runway speeds continued to fall? 

The second major issue concerns the pre- 
cipitating circumstances of depression: help- 
lessness or the loss of incentives. One ex- 
periment with human subjects (Roth & 
Bootzin, 1974) produced helplessness by 
providing random feedback in a concept 
learning task without using aversive stimu- 
lation. "These "helpless" subjects later were 
more likely than controls to take steps 
against equipment malfunctions. 

The learned-helplessness position assumes 
that helplessness (i.e, noncontingency of 
outcomes on behavior) is inherently aver- 
sive. However, such a proposition appears 
hard to defend. Whether noncontingency 
is aversive clearly depends on what is in- 
volved. Pigeons and rats are willing to 
work for periods of noncontingent reward 
(Neuringer, 1973), and among humans 
commercial gambling finds many devotees. 

A third major issue concerns the effec- 
tiveness of a preventive approach suggested 
by the learned-helplessness position, the no- 
tion that "mastery training" in childhood 
can "immunize" people against depression 
later. Animals with previous successful ex- 
periences at escaping stress, who are then 
subjected to inescapable stress, later relearn 


escape, when that becomes possible again, 
more easily than animals whose inescapable 
stress constitutes their first experimental ex- 
perience. Experience with partial rein- 
forcement schedules can also increase rats’ 
persistence during extinction (Amsel, 1972). 
However, there is, so far, no indication that 
people can be immunized against depression. 
(Why does success not immunize against 
success depressions?) The incentive-dis- 
engagement theory does agree that early 
mastery training may reduce the frequency 
of depression by teaching people to persist 
longer during the invigoration and aggres- 
sion phases of the cycle, thereby preventing 
premature discouragement and failures by 
default. However, nothing in the incentive- 
disengagement position suggests that mas- 
tery training would reduce the severity of 
depression when losses occur. Therefore, 
this viewpoint suggests that it may be de- 
sirable to educate people concerning the 
course, inevitability, and naturalness of de- 
pressive reactions, thereby perhaps alleviat- 
ing some of the secondary consequences of 
depression, such as anxiety and self-doubts 
aroused by the fact of depression having 
occurred. 

If the incentive-disengagement theory of 
depression is correct, how can we explain 
the results of inescapable stress? It may be 
that these reflect the operation of two fac- 
tors. First, subjects in these experiments 
may indeed learn the uselessness of respond- 
ing, just as the learned-helplessness theory 
suggests. This, then, would account for 
the subject’s subsequent passivity in the 
face of opportunities to escape. Second, 
however, the removal of such incentives as 
comfort and safety may also produce a state 
of depression, which may then account for 
some of the more general effects, similar to 
those of other incentive loss. In other 
words, the two-factor explanation suggests 
that incentive loss produces depression. 
Those situations in which the organism 1s 
unable to escape stress additionally produce 
specific learned helplessness. 


Implications for Drug Use 


The incentive-disengagement cycle con- 
sists of a succession of feeling states, most 
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of them unpleasant. Psychoactive drugs 
provide ways to alter feeling states. Al- 
though some effects of drugs seem to be 
producible by any of a number of substances 
—for instance, drug-produced "highs" seem 
to have some common elements— different 
varieties of drugs exert different specific ef- 
fects as well. For instance, marijuana en- 
hances the incentive value of almost all sen- 
sory experiences (McKenzie, Note 3; Tart, 
1971). Alcohol in at least moderate dos- 
ages confers an increased sense of optimism 
and "power" (McClelland, Davis, Kalin, & 
Wanner, 1972), as do the amphetamines, 
Which additionally reduce fatigue, depres- 
sion, and boredom (Cole, 1967). Marlatt 
(Note 4) reports that “almost a third of all 
relapses [of alcoholic patients] occurred 
when the patient was frustrated or angry ;” 
and he reports experimental findings that 
heavy drinkers drink heavily after they have 
been insulted with no opportunity to retali- 
ate, but they drink little if they had an op- 
portunity to retaliate. Scattered interview 
evidence suggests that among long-term 
users heroin has a normalizing effect on 
mood, anesthetizing the user to the effects 
of living with few major positive incentives 
and numerous negative ones (Cortina, 
1970). 

The possibility therefore arises that the 
specific effects of various drug classes may 
be differentially rewarding in different 
phases of the incentive-disengagement cycle. 
In that case, the concept of the cycle may 
help to account for differential patterns of 
use. For instance, one might suppose that 
marijuana would appeal especially to peo- 
ple who seek to enhance incentive values 
perhaps because they are deprived of ap- 
pealing incentives, because they have become 
too adapted to the incentives available to 
them, or for recreational purposes. Simi- 
larly, alcohol may be especially preferred 
during the latter portions of the invigoration 
or aggression stages, perhaps especially 
when the individual fears failure or impo- 
tence due to his own personal limitations, 
Heroin may become a drug of choice in 
desperate situations barren of hope for posi- 
tive incentives and rich in negative ones; 
and so on, 


This is not, of course, to lose sight of 
the multiple factors that determine drug 
use, such as the general pleasure of a "high," 
cultural factors governing use, availability, 
costs, ritual, and beliefs in self-exploratory 
or other cognitive and experiental benefits. 
Nevertheless, the conception of the incen- 
tive-disengagement cycle suggests a power- 
ful array of factors for the analysis of dif- 
ferential drug preferences and use patterns. 


Other Implications 


The conceptualization described above 
can be applied to a number of other prob- 
lems. For instance, changes in age status 
often force people to give up satisfying ac- 
tivities and enlivening aspirations. Changes 
in the value of old incentives, such as jobs 
and marriage partners, may turn an incen- 
tive-rich life situation into a barren one. 
Although the consequences of these rela- 
tively gradual changes in incentive value are 
not discussed above, they may have similar 
effects as losses, including depression (Cos- 
tello, 1972), Chronically barren life situa- 
tions probably produce rather different out- 
comes, but here, too, in view of the organi- 
zational properties of current concerns, the 
present theory can contribute predictions. 

The theoretical position advanced here 
also carries a number of implications for 
education, psychotherapy, and eupsychian 
management. It suggests that one of the 
functions of education for mental health 
should be to teach the nature and ramifica- 
tions of commitments to goals, including 
both the virtues and the risks of aspiring to 
difficult goals. Such education would place 
the experience of depression into a compre- 
hensible framework that is neither alarmist 
nor dismissing but that treats depression as 
an important kind of information about one’s 
life situation from which one can deduce 
new plans for self-fulfillment, Psychotherapy 
for depression would incorporate these func- 
tions and would additionally aid patients 1n 
working out coping strategies suitable to 
their situation, Finally, the present theory 
Suggests managerial techniques to improve 
employees' functioning and job satisfaction. 


- W kk lll 
— — CP. RR 


INCENTIVE RELATIONSHIPS No. 


CONCLUSIONS 


There is compelling evidence that behav- 
ior is organized around the enjoyment and 
pursuit of incentives. Between the time an 
organism commits itself to striving for an 
incentive and the time it consummates or 
abandons the incentive it is in a special state 
with respect to the incentive, a state called 
here a "current concern." During states of 
current concern, organisms are more respon- 
sive to concern-related stimuli than they 
would otherwise be to those stimuli, and 
they respond with elements of the con- 
cern-related responses in their repertory. 
Therefore, current concerns potentiate con- 
cern-related content in fantasy. Loss or im- 
poverishment of a person’s incentives can 
therefore further be predicted to reduce ac- 
tivity level, flatten or depress affect, reduce 
the variety and interest of fantasy, and pro- 
mote st;mulus-bound and especially somatic 
fantasy. 

When organisms are blocked in their pur- 
suit of an incentive, they go through an 
orderly sequence of changes called here the 
incentive-disengagement cycle, whose phases 
(which are not discrete) include invigora- 
tion of behavior, aggression, downswing 
into depression, depression, and recovery. 
The content of the cycle in a particular in- 
stance depends in predictable ways on the 
particular circumstances, and the severity of 
affect depends on the importance of the in- 
centive, The cycle corresponds to the 
course of extinction. Depression is ordi- 
narily therefore a normal, adaptive part of 
disengaging oneself from an incentive. 

The concept of the incentive-disengage- 
ment cycle suggests places to look for in- 
dividual differences in susceptibility to de- 
pression and can account for depressive 
symptoms such as hopelessness, hostility, 
lowered self-esteem, and hypochondria. The 
particular kinds of unpleasant fantasy and 
affect experienced during the incentive-dis- 
engagement cycle may mesh with the specific 
effects of psychoactive drugs to help ex- 
plain the use of marijuana, alcohol, am- 
phetamines, narcotics, and probably other 
“mind-altering” substances, 

The effective value of an incentive changes 
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with changes in the expectation that it can 
be attained, with habituation, changes in 
adaptation level, and experience with the in- 
centive class. These factors may operate to 
impoverish a person’s incentive world and 
to affect the course of careers, marriage, 
and aging. 

The incentive-disengagement theory of de- 
pression carries specific implications for psy- 
chotherapy, education, and eupsychian man- 
agement, 
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The experience of alienation is conceptualized as a sequential-developmental 


process which (a) develops in the 
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Although the term “alienation” has been 
associated with numerous connotations and 
used to describe a variety of phenomena, it 
nonetheless evokes specific images in rela- 
tion to contemporary American culture, The 
ghetto riots and student demonstrations of 
the sixties, the hippie movement, the drug 
culture, and the recent rash of politica] kid- 
nappings—all of these phenomena come to 
mind as manifestations of alienation ; and all 


Seem to involve segments of the population 
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context of an ongoing relationship be- 
tween an individual and another person or group of people 


; (b) involves 


n in the quality of outcomes provided to the in- 
and (c) persists to the extent that the individual 


psychologically proximal. On the 


ts, personal-neutral thwarting and salience- 
of estrangement experiences is pro- 


syndromes: isolation, reintegration, 


Though in each of these syndromes the indi- 


uation leads to at least temporary disillusion- 
ment, the induction of persisting cognitive 


changes as a result of one’s ex- 


Personal thwarting by another is viewed to be the central and 
unique feature of the alienation experience 


Finally, research directions 


implications of the proposed framework are discussed. 


Despite the fact that alienation represents 
a pervasive theme in American culture and 
connotes common meanings to many people, 
our understanding of the determinants and 
manifestations of alienation remains quite 
limited. Numerous conceptualizations of 
alienation have been proposed from histori- 
cal, philosophical, sociological, and psycho- 
logical perspectives ; yet, very little empirical 
research concerning the phenomenology of 
alienation heretofore has been conducted. 

The lack of such research seems largely 
attributable to the nature of previous theo- 
retical perspectives. In general, alienation 
has been conceptualized as a static phenome- 
non rather than a dynamic one. Historical 
and sociological analyses (e.g., Bell, 1960; 
Blauner, 1964; Durkheim, 1897/1951 ; Marx, 
1844/1964 . Merton, 1938; Simmel, 1902/ 
1950; Weber, 1958) typically focus upon 
Social-structural factors which appear to be 
linked to conditions of widespread aliena- 
tion within society; for example, auto- 
mation, anonymity, bureaucratization, and 
economic inequality, Psychological and 
philosophical analyses (e.g, Camus, 1956; 
Keniston, 1965. Kierkegaard, 1959; Sartre, 
953; Seeman, 1959) place a greater €m- 
phasis upon the individual's experience and 
expression of alienation, but these concepttt- 
alizations also appear to be static ones; 
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alienation is viewed either as a personality 
disposition or an a priori condition of hu- 
man existence. 

The conceptualization proposed here em- 
phasizes the experiential dimensions of ali- 
enation but extends earlier psychological 
approaches in attempting to identify and 
operationalize the determinants and core dy- 
namic of this experience. The analysis in- 
corporates three fundamental components: 
(a) a set of antecedent conditions, deriv- 
ing from one’s physical/social environment, 
which engenders (b) a specific psychological 
experience, having motivational overtones, 


and expressed as (c) a set of behav- 
ioral manifestations. Alienation is viewed, 
then, as a sequential-developmental process 
through which certain social and physical 
conditions within a particular environment 
evoke specific feelings and behaviors in its 
occupants, 

The present discussion of alienation is 
focused primarily at the small-group level. 
Considering that previous analyses generally 
have been developed within a broader social 
context and, thus, have emphasized the re- 
lationship between sociocultural conditions 
and alienation, the small-group setting ap- 
pears to be an atypical one in which to ex- 
amine the phenomenon of alienation. The 
microcosm of the small group, however, of- 
fers an advantageous context in which to 
study alienation for it permits a dynamic, 
situational analysis and thereby facilitates 
the development of predictive theory from 
which experimental hypotheses can be de- 
rived and tested. An individual's relation- 
ship with society is of an almost abstract 
nature and certainly of great duration. By 
contrast, his/her interpersonal relationships 
which involve specific individuals are often 
initiated and terminated within a relatively 
short period of time, Therefore, it becomes 
more feasible to examine the onset and un- 
folding of one's alienation from other people 
rather than from society, since the temporal 
stages of alienation are more readily de- 
tected and delimited at the former level of 
analysis than at the latter, 

An important assumption underlying the 
Proposed theoretical framework is that there 


are intrinsic similarities between interper- 
sonal and sociocultural alienation, and that 
an investigation of the former has relevance 
for an understanding of the latter. This 
assumption in no way implies that the ex- 
perience of alienation as it occurs within 
small-group settings is fully equivalent with 
the varieties of sociopolitical alienation dis- 
cussed by Marx (1844/1964), Durkheim 
(1897/1951), and Merton (1938). Rather, 
it suggests that the psychological experience 
of alienation, in general, involves basic pro- 
cess features which are reflected at both in- 
terpersonal and societal levels of analysis. 

Specifically, it is assumed that an individ- 
ual's alienation develops within the context 
of an ongoing relationship between himself 
and some other entity—a person, group, so- 
ciety, or culture, for instance. The experi- 
ence of alienation is brought about through 
a decline in the quality of one’s relationship 
with a particular context, and this perceived 
deterioration evokes dissatisfaction with the 
present situation and a yearning for some- 
thing better which has been either lost or, 
as yet, unattained. 

The dynamic of disillusionment, suggested 
above, seems evident in earlier conceptual- 
izations of alienation, In the Marxian anal- 
ysis, the attainment of class consciousness 
among proletarian groups reflects a histori- 
cal process in which objective conditions of 
exploitation are apprehended by certain seg- 
ments of society and amended through pro- 
test and revolution. Similarly, Merton’s 


1In the writings of Marx, alienation is discussed 
from both philosophical and ideological perspec- 
tives, In a philosophical context, alienation is 
understood in terms of social and historical forces 
which eventuate in the objectification of labor 
(i.e, the spiritual and physical separation of labor 
from its products; cf. Marx, 1844/1964). From 
an ideological perspective, conscious recognition of 
the objectification and exploitation of labor pro- 
vides the basis on which class consciousness is at- 
tained and the forces of revolution are mobilized 
(cf. Marx & Engels, 1848). It is the ideological 
dimension of alienation which appears to imply a 
disillusionment process, though it should be noted 
that the Marxian analysis pertains to collective 
consciousness whereas the present one focuses upon 
the experience of the individual. Tt seems reason- 
able to assume that the attainment of class con- 
sciousness presupposes the shared estrangement of 
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portrayal of American society in terms of its 
"structural inconsistency” depicts a situa- 
tion in which disadvantaged groups resort 
to violence as a means of achieving a more 
equitable distribution of material goods. 
Moreover, within philosophical and psycho- 
logical analyses of alienation, a disillusion- 
ment process is reflected in the existential- 
ist’s feeling of abandonment, discussed by 
Sartre (1953), and the experiences of “Pyr- 
rhic victory" and 
alienated college st 
Keniston (1965), 

The present analysis, 
the assumption that the experience of al- 
lenation essentially involves a disillusion- 
ment process, From this perspective, Thi- 
baut and Kelley's (1959) comparison level 
theory is particularly useful as a conceptual 
base from which to move toward an op- 
erational model of alienation. Comparison 
level denotes a criterion of outcome accepta- 
bility with which an individual evaluates the 
attractiveness of a situation in terms of what 
he expects or feels he deserves. Assuming 
that alienation involves a disconfirmation of 
expectations regarding the quality of a re- 
lationship, the comparison level construct is 
germane to a consideration of the circum- 
stances under which disillusionment occurs 
and the behavioral concomitants of this ex- 
perience, 

While previous cone 
ienation share certain 
similarities are rea 
analysis reflects 
with either a soci 
social context of 


“loss of Eden” among 
udents, as described by 


then, begins with 


eptualizations of al- 
continuities, their dis- 
dily apparent, First, each 
a characteristic concern 
etal/cultural or personal/ 
alienation, Historical and 
sociological perspectives typically focus on 
the relations among groups within society, 
whereas psychological and philosophical ap- 
proaches generally examine the individual's 
experience in relation to other persons, 

A second dissimilarity among earlier ana]. 
yses concerns the type of disillusionment 
and behavioral response suggested by each. 
The proletarian revolutions Predicted by 


individual workers from the capitalist system of 
Production, Reference to Marx in this discus 
though, is in no way intended to equate sociologi- 
cal and Psychological notions of alienation, but 
merely to suggest a possible parallel between ele- 
ments of disillusionment in both, 


sion, 
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Marx (1844/1964), and the structural in- 
consistency discussed by Merton (1938), 
exemplify situations in which segments of 
Society come to recognize their inferior 
Or disadvantaged position vis-à-vis other 
groups. To the extent that such conditions 
aré prolonged and the sources of inequity 
are identifiable, the estrangement of the 
downtrodden tends to intensify and provoke 
angry protest. In comparison with the anal- 
yses of Marx, Durkheim, and Merton, those 
of Keniston, Sartre, Simmel, and Weber 
focus upon a qualitatively different set of 
alienating social conditions. The alienation 
discussed by the latter group appears to be 
more passive and less hostile in nature and 
results from a generalized dissatisfaction 
with one's milieu. The sources of such dis- 
satisfaction are abstract and impersonal 
rather than specific and intentional, 

To account for the divergent patterns of 
alienation suggested in previous discussions, 
the present analysis develops an extension 
of comparison level theory which incorpo- 
rates the dimensions of personal-neutral 
thwarting and salience-nonsalience of alter- 
natives. The interaction of these. compo- 
nents suggests a typology of behavioral pat- 
terns which seems consistent with earlier 
analyses. 


It should be emphasized that the theo- 
retical perspective developed in the ensuing 
paragraphs is concerned specifically with 
interpersonal Phenomena. The concepts in- 
troduced, by Providing a possible link be- 
tween the Psychological experience of indi- 
viduals and their collective action in re- 
Sponse to mutually alienating conditions, 
may prove to be applicable to an analysis 
of intergroup relations—the traditional con- 
cern of sociological perspectives on aliena- 
tion. Explicit consideration of the interface 
between interpersonal and intergroup phe- 


nomena, however, is beyond the purview of 
this discussion. 


A THEORETICAL FRAMEWORK ror THE 
TUDY OF ALIENATION IN THE 
SMALL Group 


The analysis begins at a general level and 
subsequently turns to a more specific con- 
Sideration of the Small-group situation. The 
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initial stages of the analysis provide a re- 
capitulation of certain ideas introduced ear- 
lier, whereas the latter stages develop those 
ideas more fully in terms of social psycho- 
logical theory. 

As noted above, the proposed conceptuali- 
zation of alienation includes three basic com- 
ponents: a constellation of antecedent con- 
ditions, a specific psychological experience, 
and a set of behavioral manifestations. Each 
component includes a unique set of dimen- 
sions. It is useful to focus first upon the 
psychological experience of alienation since 
an analysis of this component leads quite 
naturally to a consideration of its anteced- 
ents and consequences. 

The experience of alienation. Alienation 
can be viewed as a form of dissatisfaction 
resulting from one’s perceived association 
with a negatively valued activity, person, 
group, or culture. This preliminary defini- 
tion suggests two important aspects of the 
alienation experience. First, it develops with 
reference to some particular context in 
which a person, P, feels embedded and to- 
ward which he feels negative sentiment. The 
spatial scale of the context may vary along 
a continuum ranging from the personal, to 
the social and cultural levels. Second, the 
experience of alienation persists over time 
and remains salient to P as long as he feels 
constrained to the undesirable context. As 
P becomes aware of his simultaneous prox- 
imity to, yet dislike of, a particular con- 
textual referent, his recognition of this dis- 
crepancy will lead to frustration. The per- 
sistence of frustration, arising from one's 
Constraint to an unsatisfactory context, 
should result in specific motivational over- 
tones—for example, the desire to dissociate 
oneself from the context, the inclination to 
ae alternatives, or tendency to in- 

Itis ID or object. B i 

hat the above definition does 

nort permit a clear distinction between alien- 
ation and other PSychological phenomena. 
The conditions Specified would be applica- 
ble, for example, to the general experience 
Of dissatisfaction as described by Thibaut 
and Kelley (1959), Psychological reactance 
i inp by Brehm (1966), or cognitive 
alance as portrayed by Heider (1958). 
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In order to distinguish alienation from other 
theoretical constructs, its spatial and tem- 
poral dimensions must be considered more 
fully. 

The antecedents of alienation. Within the 
personal, social, and cultural contexts of al- 
ienation, a common configuration of ele- 
ments is found: (a) an object, O, with 
which P feels associated; (b) a type of 
thwarting, with respect to a given set of 
outcomes, having a particular source, direc- 
tion, and either high or low intentionality ; 
the thwarting, whether emanating from O or 
not, leads to either a temporary or a perma- 
nent devaluation of O by P; and (c) a set 
of forces which constrains P to his associa- 
tion with O; these forces may result di- 
rectly from O's actions toward P, or through 
the unavailability of better alternative rela- 
tionships. 

The object of P’s alienation, within a per- 
sonal context, may be represented as an ac- 
tivity in which P is engaged. The worker, 
for example, becomes alienated from his 
work as a result of its monotony and im- 
personality, Within a cultural context, P 
may perceive himself to be associated with 
a particular country through formal citizen- 
ship yet extremely dissatisfied with certain 
aspects of citizenship, such as having to 
serve in the army and participate in an un- 
just war. Finally, in the context of a small- 
group situation, P’s alienation may derive 
from an inequitable distribution of oppor- 
tunities for participation, or certain pre- 
interaction constraints (e.g, group size) 
which impair his integration into the deci- 
sion-making process. Although P is un- 
happy with his present situation, he is 
forced, for lack of a better alternative, to 
remain associated with or proximal to the 
group. 

In terms of Heider’s (1946, 1958) bal- 
ance theory, each of the above examples re- 
fects a similar configuration of circum- 
stances in which P's unit and sentiment 
relations. with a particular object, O, are 
discrepant. According to Heider, P's rela- 
tionship with O is balanced to the degree 
that his perceived linkage with O coincides 


with favorable feelings toward O (or con- 


versely, P's removal from O is consistent 
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Ficure 1. Sentiment/unit combinations and the 
sequential patterns of alienation. (The inclusion 
of P, an individual, in the square denotes a unit 
relation, whereas P’s presence outside the box 
designates the absence of one. The plus or minus 
sign within the square denotes P's sentiment to- 
ward the context or object, O. The arrows rep- 
resent the developmental paths leading to the con- 
figuration of alienation.) 


With a negative evaluation of O). Tt fol- 
lows, then, that in the three examples dis- 
cussed above, a state of imbalance between 
P and O exists. Yet, the characterization 
of alienation as a simple state of imbalance 
pon the specific ways in 
which the experience develops, the param- 
eters of its intensity, or the form of its 
manifestation. To gain some understanding 
of these issues, the temporal, or develop- 
mental, nature of alienation must be ex- 
plored further. 

Considering P's relationship to O, four 
combinations of sentiment/unit relations are 
possible: 


1. positive sentiment/positive unit, 
2, negative sentiment/negative unit, 
3. positive sentiment/negative unit, 
4. negative sentiment/positive unit, 
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In Figure 1, each of these combinations hs 
represented in diagrammatic fashion. In 
the present analysis, it is proposed that the 
fourth possibility, namely, the negative sen- 
timent/positive unit combination, describes 
the pattern of circumstances reflected in a 
situation of alienation, Furthermore, each 
of the other three combinations denotes an 
initial state of affairs from which the con- 
figuration of alienation may be reached. 
Three developmental patterns or paths, 
leading to the conditions of alienation, ed 
posited: (a) “disillusionment” or — 
(b) "disenchantment," and (c) "confirme 
estrangement.” The path traveled depends 
upon which of the three initial sets of senti- 
ment/unit relations precedes the configura- 
tion of alienation ( —sent/-- unit). ] 

Each of the four possible sentiment/unit 
combinations can be characterized with re- 
Spect to two dimensions, namely, the de- 
gree of P's involvement in his relationship 
with O, and P's expectation of either posi- 
tive or negative outcomes from his asso- 
ciation with O. In the context of this dis- 
cussion, the dimension of involvement refers 
specifically to those variables which concern 
the amount and immediacy of P's experience 
with O (eg. the duration of the P-O re- 
lationship, or P's spatial proximity to O), 
rather than P's subjective orientation to- 
ward O (eg., his commitment or enthusi- 
asm toward O). Expectation, on the other 
hand, relates to the quality and stability of 
outcomes anticipated by P on the basis of 
his experience with O. From the above, it 
follows that close and continuous involve- 
ment with O could eventuate in either posi- 
tive or negative expectations, depending 
upon the quality of P’s cumulative experi- 
ence with O, Moreover, little or no direct 
contact with O might set the stage for either 
Positive or negative expectations, depending 
upon the information p receives about O 
through others, 


In the +sent/tunit situation, P's in- 
volvement and expectations are both high. 
This pattern ig reflected, for example, in the 
case where one has been an active member 
of a group and has experienced consistently 
favorable outcomes throughout the term 0 


| 


wo? 
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his membership. The +sent/—unit situa- 
tion reflects low involvement but high ex- 
pectations on the part of P regarding his 
potential relationship with O. The case in 
which a freshman attends the rush of a 
highly desirable fraternity is an example of 
this pattern. The —sent/—unit situation 
involves low involvement and low expec- 
tations; P is neither related to O at pres- 
ent nor does he wish to become associated 
with O in the future. Finally, the —sent/ 
--unit pattern of alienation reflects high in- 
volvement and low expectancies for favora- 
ble outcomes, on the basis of P's present 
dissatisfaction with O. 

It is apparent that the dimensions of in- 
volvement and expectation provide a basis 
for considering the determinants of the in- 
tensity of alienation and the form of its 
manifestation over.time. From an exten- 
sion of Thibaut and Kelley's (1959) com- 
parison level theory, it is expected that P 
will experience the greatest degree of com- 
parison level discrepancy when his experi- 
ence of alienation arises through the process 
of disillusionment or reversal. For, it is 
along this path that P begins with high in- 
volvement and expectations only to arrive 
at, or be jolted into, an extremely unfavor- 
able situation. To determine why this turn 
of events sets the stage for the most intense 
experience of alienation, the relationship be- 
tween the dimensions of involvement and 
expectations must be considered more fully. 

Any situation in which P has experienced 
consistently favorable outcomes from O 
should promote the development of P's com- 
mitment to O (cf. Horwitz, 1954; Thibaut, 
Friedland, & Walker, 1974). That is, P 
comes to “believe in" O as a reliable source 
of positive outcomes, Somewhat related to 
ae Proposition js Jones and Gerard's 
(1367) concept of “basic antinomy" which 


5 H - H a i 
posits that an individual's attributions about 
a particular entity 


3 are most amenable to 
change during the early stages of exposure 
= the entity but become progressively more 
inflexible as the duration of exposure or in- 
teraction increases, Thus, as a favorable 
relationship continues, P should become in- 


Creasi " i > 
asingly commited to O and resistant to 


any information which might disconfirm his 
belief in O. However, once the evidence 
of O's suddén inadequacy or betrayal ex- 
ceeds P's "threshold of disconfirmation," the 
resulting dissolution of P's commitment and 
expectations should induce a level of dis- 
illusionment whose intensity is magnified by 
the extent of P's prior involvement with O. 

In the process of disenchantment, P be- 
gins with high expectations but with a mini- 
mal amount of previous exposure and com- 
mitment to O; and subsequently finds him- 
self in a worse-than-expected relationship. 
The freshman who has been invited to join 
the initially desirable fraternity realizes his 
mistake and entertains the thought of de- 
pledging. However, as yet he feels unable 
to do so because of certain social pressures. 
"This situation provides the basis for a mod- 
erate degree of comparison level discrepancy 
and accompanying alienation, since the re- 
versal of P's expectations does not also in- 
volve an obliteration of earlier commitment 
to O. 

The pattern of confirmed estrangement 
creates the lowest level of comparison level 
discrepancy for P. He begins with minimal 
expectations and prior involvement, only to 
have his predictions confirmed through un- 
avoidable association with O. Consider, for 
example, the black student living in a pre- 
dominantly white school district. He is 
forced by his residence to attend a white 
school. If he expects, prior to his enroll- 
ment, to encounter racism and prejudice at 
the school and later finds his predictions to 
be accurate, he will experience alienation. 
However, it is expected that the intensity of 
his experience will be less than what it 
could have been had he entered the school 
anticipating favorable outcomes. 

The involvement and expectation dimen- 
sions offer only a partial basis for predicting 
the intensity of alienation. Other factors 
mav interact with these dimensions to mod- 
ify ‘their impact on P. Specifically, more in- 
formation is required concerning the factors 
which instigate the processes of disillu- 
sionment, disenchantment, and confirmed 
estrangement. A consideration of those 


thwartings which lead to a deterioration in 
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Ficure 2. Neutral and personal processes of 
disillusionment. (The inclusion of P, an individ- 
ual, in the square denotes a unit relation whereas 
the presence of P outside the box designates the 
absence of one. The plus or minus sign within 
the square denotes P’s sentiment toward the ob- 
ject or context, O. The arrows represent the de- 


velopmental paths leading to the configuration of 
alienation.) 


P's relations with 
needed information, 

The remaining discussion will focus upon 
the process of disillusionment for two rea- 
, it is necessary to assume that 
a relationship exists between P and O in 
order to examine the impact of various 
types of thwarting on that relationship. This 
assumption is possible only with respect to 
the disillusionment pattern of alienation, in 
which a formerly satisfying relationship 
with O (even one which only recently has 
been established) turns sour. Second, the 
reversal of P’s feelings toward O from com- 
mitment to disillusionment represents the 
most typical characterization of alienation, 
as described in the historical, sociological, 
Philosophical, and psychological literature, 

As stated earlier, P’s experience of aliena- 
tion arises from a specific thwarting which 
Strains his relationship with O, A thwart- 
ing may be characterized w 
three basic dimensions : 


O should provide the 


ith respect to 
(a) its source, (b) 
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direction, and (c) intentionality. The first 
refers to the origin of the force; that is, 
whether it emanates from O or from the 
extraneous environment. The second re- 
lates to the target of the force; for example, 
whether or not it is directed specifically at 
P. And the third concerns P's attribution 
of intentionality to the source of the force. 
These dimensions may be combined in : 
Variety of ways to yield different types o 
thwartings, ; 

The psychological impact of a particular 
thwarting will be mediated by many pe 
textual variables. A highly directed anc 
intentional thwarting, for instance, could be 
interpreted differently by P depending upon 
the information he has concerning the norms 
or situational pressures which explain O's 
actions. When O's actions are interpreted 
as reasonable in view of the particular situ- 
ation (e.g, when it becomes necessary for 
P to accept an undesirable role for the good 
of the group), less dissatisfaction and al- 
ienation should be aroused in P than when 
O's actions appear to be arbitrary and un- 
justified (e.g. P is unfairly chastised or in- 
sulted by O). In a similar vein, an un- 
directed and unintentional thwarting might 
arouse divergent emotions in P depending 
upon his prior experience with O. To the 
extent that the relationship has been un- 
favorable in the past, P may assimilate even 
an unintentional thwarting to the charac- 
teristic deficiencies he ascribes to O. 

Two types of disillusionment processes 
are represented in Figure 2, Each results 
from a different kind of thwarting. The 
first case involves a neutral thwarting—that 
is, one which does not emanate directly from 
O, is not specifically directed at P, and is 
Perceived by P as being unintentional. Spa- 
tial Constraints, information overload, and 
large group size represent neutral thwart- 
ings at the small-group level, Each con- 
straint hinders P's relationship with O in 
some sense, Assuming, for example, that 
P desires to participate in group discussion 
and decision making, this need may be 
thwarted by environmental circumstances 
which make it difficult for him to interact 
meaningfully with the group. Lack of avail- 
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able space impairs the coordination of P’s 
activity with that of other group members; 
and the presence of too many members re- 
duces P’s opportunity to participate in group 
decision making. : 

The second type of disillusionment pro- 
cess involves a personal thwarting—that is, 
one which stems directly from O, is specifi- 
cally directed at P, and is perceived by P 
as reflecting O’s intentions. A hostile or 
inequitable action represents an example of 
a personal thwarting. The intentional rele- 
gation of P to tedious and low-status roles 
is likely to be perceived by him as unfair 
discrimination, exclusion, and rejection by 
the group. 

In view of the differences between the 
neutral and personal patterns, it is expected 
that P's disillusionment will be more bitter 
and irreversible in the latter case than in 
the former one, In both processes, P ex- 
periences a reversal in his feelings from high 
commitment and expectations to unantici- 
pated frustration. As suggested earlier, 
such reversal results in comparison level 
discrepancy and puts a strain on P’s com- 
mitment to O. In the personal pattern, 
though, the source of P’s frustration is more 
specific, and its intentions more visible and 
malevolent. The sudden communication of 
unnecessary insult or injury by O intro- 
duces the element of “rejection” into P's 
experience of disillusionment. Thus, the 
“gain-loss” element (Aronson & Linder, 
1965) arising from P’s rejection by O, when 
coupled with the frustration of P's expec- 
tancies regarding favorable outcomes, should 
Promote a more intense and irreversible ex- 
Perience of alienation than when comparison 
level discrepancy operates alone. Moreover, 
rejection, by definition, implies a low cor- 
Cj Dondenoe between the outcomes of P and 
^ fus & Kelley, 1959) ; as well, there 

2 “icy on the part of P to reciprocate 
O's rejection (Berscheid & Walster, 1969). 
Hence, the Possibilities for salvaging a satis- 
factory relationship with O are minimal, and 
the probability that P Will dissolve his com- 
mitment to O is high, 


i In the neutral process of disillusionment, 
Jowever, the source of P’s frustration is 


vague and less personal in nature. Because 
there is no ready source to which P can 
impute foul play, the commonality of inter- 
est between P and O is greater than in the 
personal case. And because there is more 
basis for cooperation in the neutral case 
than in the personal one, P will be more 
amenable to an eventual reconciliation with 
O in the former instance, Thus, although 
a neutral thwarting may at first seem un- 
justifiable to P in view of O’s previous re- 
liability as a source of Positive outcomes, 
the strain in P’s commitment to O should 
soon dissipate as long as evidence of O's 
malevolent intent continues to be lacking. 

The term, “thwarting,” has been inter- 
preted in the present analysis as a force 
which provokes a reversal in the quality of 
P’s relationship with some object, O. The 
reversal has been characterized as a progres- 
sion from a satisfactory state of affairs, en- 
dorsed by P, to an unsatisfactory situation 
involving his disillusionment. Hence, the 
term thwarting, as used above, refers to a 
type of force which provokes P's experience 
of alienation and partially determines the 
intensity of that experience. 

In order to arrive at a more adequate rep- 
resentation of the alienation experience, 
however, we must consider the impact of a 
particular thwarting in the context of those 
“restraining forces” which enforce P’s con- 
tinued association with O once the reversal 
process has occurred. For, without P's 
prolonged proximity to or association with 
the object of his disillusionment, there would 
be no basis for his experience of alienation. 
He would merely dissociate himself from the 
alienating context, and its aversive salience 
would thereby decrease. 

The —sent/4-unit configuration of aliena- 
tion, thus, reflects a “nonvoluntary” rela- 
tionship (Thibaut & Kelley, 1959). In 
such situations, P is forced, over time, to 
remain at an infra-comparison level state of 
outcome quality. A variety of restraining 
forces may operate to enforce P's associa- 
tion with the object of his alienation; physi- 
cal pressures emanating directly from O, or 
P's lack of more desirable alternative rela- 


tionships, for example. In the small-group 
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situation, P may be forced by social pres- 
sures against "copping out," to continue his 
membership with some group; or he may 
decide to continue his association with the 
group ior want of a better alternative. In 
either case, P perceives that desirable al- 
ternative relationships are unattainable and 
consequently remains in an unsatisf 
situation, 

The behavioral manifestations of aliena- 
tion. The behavioral component of aliena- 
tion can be considered in terms of its form 
and its effects. The form of response to 
felt alienation can be characterized as ag- 
gressive or nonaggressive, active or passive, 
and directed or nondirected. The hositility, 
intensity, and directionality of a person’s be- 
havior would convey the quality and strength 
of his feelings, Regarding its effects, be- 
havior can be viewed as either adaptive or 
maladaptive; that is, as an action which 


alleviates or intensifies P’s experience of 
alienation. 


It is proposed that P's behavior will be 
most active, directed, and aggressive in re- 
sponse to a personal thwarting, where the 
source of one's frustration is specific and 
intentional. Furthermore, the gain-loss ele- 
ment provokes a clear conflict of interest be- 
tween P and O; hence, minimal opportunity 
exists for salvaging the damaged relation- 
Ship. This Situation, then, which elicits 
frustration. and provides cues for hostility 
(manifested in O's rejection of P), as wel] 
aS a specific target for P's counteraction, is 
highly conducive to aggressive behavior (cf. 
Berkowitz, 1965; Buss, 1961), 

The neutral pattern of reversal, by con- 
trast, provides a greater range of behavioral 
options for P, Since the event leading to 
P's frustration is unintentional and its 
Source vague, a basis for cooperation be- 
tween P and O exists. Thus, P's response 
to a neutral thwarting will be of à problem- 
Solving nature, His behavior may assume 
a variety of forms over time. In response 
to Spatial constraints, for example, P may 
Initially attempt to improve the coordination 
of his activity with that of the group, If 

his efforts are ineffective, he may withdraw 
temporarily into passive isolation, Pro- 


actory 
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lead to some form of hostility or permanent 
isolation. . 

The form of P's response to a particular 
thwarting will also be influenced by the sa- 
lience of desirable alternative relationships 
The present conceptualization presupposes 
the existence of a positive unit relation be- 
tween P and O. That is, P perceives him- 
self to be linked to O through spatial prox- 
imity or the expectation of future proximity. 
Moreover, this perceived linkage is unpleas- 
ant and nonvoluntary to the degree that P 

| 


longed frustration, however, may eventually 


finds O unattractive and presently feels un- 
able to improve their relationship or estab- 
lish an alternative one. Although for the 
time being he is temporarily and unwill- 
ingly constrained to O, it is quite possible 
that he may expect to achieve a more satis- 
factory relationship in the near future. Re- 
gardless of whether viable alternatives are 
actually available to P, the anticipation of 
an improved relationship with O or an 
available substitute should markedly affect 
the intensity and overt aggressiveness of P's 
Tesponse to either a person 
thwarting. 


P's behavior may be viewed in the context V, 
of four situational configurations : 


l. neutral thwarting/perceived lack of | 
alternatives, | 


. neutral thwarting/salient alterna- | 
tives, | 

. personal thwarting/perceived lack of 
alternatives, | 


personal thwarting /salient alterna- ! 
tives. | 


al or neutral 


2 


The combinations are ordered with respect 
to the degree of frustration each evokes 1” 
P. The perception of available alternatives 
in the context of a neutral thwarting shoul 
engender the least amount of frustration”: 
whereas a personal thwarting, in the face ° 
salient alternatives, would evoke the mo 
Hence, the instigation to aggression show, | 
be Strongest in the latter situation i 
weakest in the former, E 1 
Four ideal types of response, to the V / 


Big : sug? 
ous combinations of restraints, are 
gested: | 
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l. neutral thwarting/per- 
ceived lack of alternatives — isolation 
neutral thwarting/sali- 
ent alternatives —————— reintegration 
personal thwarting/per- 
ceived lack of alternatives — subjugation 
4. personal thwarting/sali- 

ent alternatives ——— — —— rebellion 


N 


i) 


Assuming that P is, for the time being, 
constrained to an unsatisfactory relationship 
with O, either by force or current lack of al- 
ternative situations, what are the behavioral 
options open to P in each of the above con- 
figurations? Thibaut and Kelley (1959) 
propose two means by which an individual 
can reduce his frustration in a nonvoluntary 
relationship : (a) lower the comparison level, 
or (b) find some way to achieve the pres- 
ently unattained level of outcome quality. 
Each behavior represents an adaptive re- 
sponse to felt alienation. 

The lowering of one’s standards regard- 
ing the quality of outcomes he deserves 
would be most likely to occur in situations 
where desirable alternative relationships 
were perceived to be unattainable. In con- 
trast, continued search for a way of meeting 
current standards of outcome quality would 
be more characteristic of situations in which 
favorable alternatives were viewed as po- 
tentially available. 

To predict the degree to which P's be- 
havior will be adaptive in the context of 
either salience or nonsalience of alternatives, 
it is necessary to consider further the be- 
havioral implications of personal and neu- 
tral thwartings. Where alternatives are 


perceived to be unavailable, a personal 
t more difficult for 


thwarting would make i 
ry out- 


P to accept presently unsatisfactory € 
comes than would a neutral thwarting, since 
the rejection and arbitrary insult conveyed 
by the former would frustrate P’s hope for 
a future improvement in his relations with 
O. Thus, the frustration resulting from 
subjugation to an unreformable situation 
would be greater than that stemming from 
temporary isolation in a situation which may 
Subsequently improve. 

_ Similarly, in situations where more de- 
Sirable alternatives are salient, P's potential 
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for effective adaptation would be greater in 
the context of a neutral thwarting, as com- 
pared to a personal one. Given that P is 
currently constrained to O, his range of 
options in dealing with an unintentional 
thwarting by O would be greater than those 
associated with a personal rejection by O. 
The first situation offers P the possibility 
of deferring judgment on O and actively 
attempting to improve the present relation- 
ship, before endeavoring to reach external 
alternatives. A personal thwarting by O, 
however, reduces P's opportunities for re- 
integration and increases his potential for 
frustration resulting from an unrelievable 
discrepancy between his negative feelings to- 
ward, yet proximity to, O. Hence, until 
imposed proximity with O was somehow re- 
duced, the likelihood of P's rebellion against 
O in the form of overt protest and aggres- 
sion would be great. 

Recent research literature on social psy- 
chological stress (cf. Lazarus, 1966; Levine 
& Scotch, 1970; McGrath, 1970) would 
suggest that when P is unable to alleviate 
his experience of alienation, and this ex- 
perience extends over a prolonged period of 
time, two general syndromes of stress may 
ensue: psychophysiological stress as re- 
flected in certain physical disorders (eg 
Wirth, 1938), and self-destructive or anti- 
social behavior as manifested in P's self- 
disparagement or aggression toward others 
(e.g., Durkheim, 1897/1951 ; Merton, 1938). 


Differentiation of Alienation from Related 
Psychological Constructs 


Having discussed the situational, experi- 
ential, and behavioral components of aliena- 
tion in greater detail, it is now essential to 
specify the dissimilarities between alienation 
and other psychological phenomena. An 
important question in this context is whether 
or not the proposed conceptualization of al- 
ienation is so broad as to embrace any form 
of disaffection. An excessively general 
framework would shed little light on the 
distinctive aspects of alienation. 

The phenomenology of alienation, as de- 
scribed earlier, does include elements of dis- 
like, dissatisfaction, imbalance, frustration, 
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and reactance. As P becomes disillusioned 
with O, he develops a dislike of O and feels 
dissatisfied with the inferior (infra-compari- 
son level) quality of his present outcomes. 
Moreover, P's embeddedness in a negatively 
valued context should promote cognitive im- 
balance and frustration, as well as feelings 
of preemption and infringement which might 
be characterized as reactance. 

Although the experience of alienation sub- 
sumes the above-mentioned elements, it can 
be distinguished from them on the basis of 
its etiology, Psychological impact, and be- 
havioral Consequences. According to the 
present analysis, P's alienation results from 
a deterioration in his formerly satisfactory 
relationship with O, Moreover, P expects 
to remain proximal to O even after the re- 
versal has occurred. This developmental 
pattern does not apply necessarily to feelings 
of dislike, dissatisfaction, imbalance, or re- 
actance. P’s dislike of or dissatisfaction 
with O does not presuppose an originally 
pleasant relationship between himself and O. 
In the case where P is blindly prejudiced or 
has extremely high expectations, for exam- 
ple, he may adopt a negative or skeptical 
orientation toward O, even before meeting 
him. And the constraint to remain in a 
negatively valued situation is not a neces- 
Sary condition for the 
Reactance can occur 
to do what he w 


arousal of reactance. 
even when P is forced 
ants or say what he wants 
to say (Worchel & Brehm, 1970), 

The present analysis of alienation relies 
heavily on the distinction between personal 
and neutral thwartings. While differences 
between these constructs were discussed ear- 
lier, it is important here to emphasize their 
similarities and uniqueness in relation. to 
Other types of thwartings for purposes of 
conceptual clarification, 


tual clai The notion of 
thwarting implies the arousal of frustration 
in P resulting from 


an interference in the 
gratification of a previously aroused motive 
(Cofer & Appley, 1964; Dollard, Doob, 
Miller, Mowrer, & Sears, 1939). The con. 
text in which frustration occurs 


can vary 
from transitory encounters to ongoing re- 
lationships, Both personal and neutral 


thwartings, as defined here, however, occur 
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within a previously pleasant ano 
and, as such, both Promote some degree 0 
disillusionment in P with regard to O. P's 
disillusionment derives from his prior com- 
mitment to O and the unexpected but sud- 
denly apparent possibility that his commit- 
ment may no longer be tenable, 

The unexpectedness of personal and neu- 
tral thwartings, in view of O's previous re- 
liability as a source of positive outcomes, 
should lead P to perceive either type of re- 
versal as unjustifiable. The unjustifiable 
aspect of these thwartings renders them 
conceptually similar to Pastore's (1952) no- 
tion of arbitrary frustration, though the 
latter construct has typically been confined 
to personally, rather than impersonally, de- 
livered thwartings (cf, Burnstein & Worchel, 
1962; Cohen, 1955; Rothaus & Worchel, 
1960). While both types of thwartings 
should arouse frustration, the personal pro- 
cess should initiate more hositility due to 
the evidence of O's malevolence and the in- 
sult this conveys to P (cf. Buss, 1961). 
Thus, personal and neutral thwartings can 
be identified, among other types of thwart- 
ings, as involving an element of perceived 
unjustifiability, and in the personal case, as 
implying O's insult of P. 

The present emphasis on personal and 
neutral thwartings is based upon widespread 
reference to the element of disillusionment, 
above and beyond frustration and hostility, 
found in most discussions of alienation, As 
noted earlier, disillusionment involves a 
strain in P's commitment to O, Whether 
Or not this strain leads to a dissolution of 
commitment depends upon P's assessment 
of the future viability of his relationship 
with O. 

The outcome of P’s appraisal will depend 
largely on the presence or absence of cues 
Which signal the future unreliability of O 
and an unresolvable "lack of fit" between 
the attributes, interests, and goals of O and 
P. In this regard, Etzioni (1968) charac" 
terized alienation as a social situation wh!¢ 
is "beyond the contro] of the actor and wie 
Tesponsive to his basic needs” (p. 879). It 
Seems reasonable to assume that the clearest 
forewarning of O's continuing unresponsive- 
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ness or malevolence would be communicated 
by personal thwartings which pose a threat 
to P's physical safety, ideology, or selí- 
concept, and imply an irreducible conflict of 
interest between him and O. On the other 
hand, the least amount of information sug- 
gesting O's future unreliability would be 
provided by neutral thwartings which are 
basically unintentional, nondirected, and 
minimally threatening to P’s security. 

In situations of neutral thwarting, the 
ambiguity of O’s future orientation would 
necessitate the adoption of an exploratory 
set on P’s part. That is, he would tend to 
withhold judgment regarding the viability of 
his relationship with O until additional, rele- 
vant information was obtained. P’s search 
for more information might take the form 
of active protest, quiet observation, or a pat- 
tern involving both elements of protest and 
contemplation. At any rate, as long as P 
remained in an exploratory set, his disillu- 
sionment with O would not eventuate in a 
final dissolution of commitment to O. 

Under conditions of personal thwarting, 
however, the jeopardization of P's safety or 
self-concept would provide clearcut evidence 
of O's potential unreliability. Such evi- 
dence should ultimately provoke P’s decision 
that the psychological distance between him- 
self and O is insurmountable. Once this 
decision has been reached, P’s commitment 
to O should be withdrawn, his perceptions 
of O altered, and in some cases, his self- 
concept modified (as reflected, for example, 
in P’s generalized reluctance to approach 
others similar to O). These cognitive 
changes in P are likely to be accompanied 
by some form of angry retaliation against O 
for his original insult of P. Because of the 
cen: undesirability of any material 
Reg previously provided by O, "E 
(Buse, imer d remain eae 
qu wt the. or expressive in nature, an 
mend in) one of its minimal reinforce- 

jt value) should dissipate subsequent to 
an initial display of hostility. 

The foregoing discussion provides not 
leap ayes Of personal and neutral 
: ng ne Context of frustration- 
gression theory but also a sketch of the 
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distinctive experiential features of the alien- 
ation syndrome. Three basic stages of the 
alienation experience can now be summa- 
rized: (a) the sense of disillusionment re- 
sulting from an unexpected thwarting which 
strains P’s commitment to O; (b) a post- 
thwarting process of appraisal by which P 
evaluates the future viability of his relation- 
ship with O on the basis of circumstantial 
information—this stage may involve instru- 
mental aggression by P aimed at clarifying 
or rectifying the unsatisfactory situation; 
and (c) P's decision that his relationship 
with O is no longer tenable—this decision 
will result in irreversible cognitive changes 
in P which may be manifested through spo- 
radic bursts of aggression. 

Whereas stages of disillusionment and ap- 
praisal are associated with neutral as well 
as personal thwartings, the induction of ir- 
reversible cognitive changes in P is viewed 
in this analysis as the unique and central 
feature of the alienation experience. All in- 
stances of alienation would seem to involve 
the general perception that one’s self-image 
or identity is highly discrepant with others’ 
perception of him, This realization could 
provide the foundation for any number of 
attitudinal changes regarding others (ie., 
“They are unfair" or "irrelevant") as well 
as oneself (“I don't need them,” "I don't 
enjoy the activities they do." “My values 
are basically different from theirs," etc.). 
To the extent that P's disillusionment with 
O effects attitudinal changes along several 
interrelated dimensions, P's alienation from 
O should become increasingly irreversible 
over time. > 

Referring to alienation as an irreversible 
syndrome is not to imply that cognitive 
changes associated with disillusionment are 
totally resistant to modification. Indeed, 
reliable evidence of O's postthwarting re- 
forms may well promote a rapprochement 
between P and O at some time in the fu- 
ture. Rather, the description of alienation, 
as irreversible is meant to distinguish its 
phenomenological features from those, of 
other psychological phenomena which are 
relatively more transitory in nature (e.g. 
momentary frustration or dissatisfaction). 
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The main contention here, then, is that the 
cognitive changes associated with P's es- 
trangement from O extend (both in a quali- 
tative and temporal sense) beyond feelings 
of frustration, imbalance, dissatisfaction, and 
dislike, and are not predicted on the basis of 
theoretical constructs pertaining 
feelings. 

The relative irreversibility of the aliena- 
tion experience is perhaps its most distinc- 
tive quality vis-à-vis similar psychological 
phenomena. Many theories describe 
tive mechanisms which enable P 
his relationshi 
entity. 
theory 


to these 


adap- 
to reconcile 
D with an initially unfavorable 
From the perspective of balance 
(Heider, 1958), for instance, P's 
Perception of a unit relation between himself 
and O should promote positive sentiment 
toward O over time. Similarly, cognitive 
consistency theory posits processes of denial, 
differentiation, and transcendence (Abel- 
son, 1959) which operate to resolve the dis- 
crepancies in P's sentiment and unit rela- 
tions with O. "Thibaut and Kelley (1959), 
in their discussion of nonvoluntary relation- 
ships, suggest that P may reduce his dis- 
satisfaction with O by lowering the com- 
parison level. And Dollard et al. (1939) 
propose that frustration can be reduced 
through cathartic aggression. The present 
analysis, however, contends that P’s aliena- 
tion, when brought about through a personal 
thwarting by O, will tend to remain un- 
resolved as long as he remains proximal to 
O, even after the objective frustrations (e.g., 
loss of status, salary) associated with the 
original thwarting have been removed. Only 
by withdrawing from or obliterating his re- 
lationship with O can he reduce the weight 
and salience of his acknowledged separate- 
ness, unengagement, and estrangement, 

As indicated earlier, the potential for P’s 
reconciliation with O will be considerably 
greater when his alienation results from a 
neutral thwarting. In view of t 
rary and reversible nature of 
alienation, it is perhaps more fruitful to 
characterize this experience as a tr 
state of disaffection rather than a 
alienation, 
affection is q 


he tempo- 
“neutral” 


ansitory 
form of 
In certain respects, such dis- 


ualitatively closer to the experi- 
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ence. of psychological reactance (Brel 
1966) than to that of alienation. According 
to Brehm, reactance is evoked in P when 
his freedom is directly or implicitly elimt- 
nated and prompts efforts on his part to 
reestablish the lost or threatened freedom. 
In the present context, P’s experience © 
a neutral thwarting provokes dissatisfaction 
with O, largely because he is no longer e 
to enjoy the rewarding outcomes which 
previously emanated from O. He is moti- 
vated, though, to explore possibilities for 
improving his relationship with O NE 
means of regaining his freedom to enjoy 
satisfactory outcomes. 

When P's alienation results from a per- 
sonal thwarting, however, the importance of 
receiving positive reinforcement from O is 
diminished. Despite the absence of favora- 
ble alternative relationships, P is presuma- 
bly more willing to surrender his freedom 
to enjoy rewarding outcomes than to at- 
tempt a satisfactory reconciliation with O. 
Thus, although the experience of “personal” 
alienation involves feelings of reactance and 
the motivation to recover the quality of one’s 
outcomes, the specific freedom preempted by 
O may be devalued and eventually replaced 
by the “freedom to be one’s own man,” 
This substitution of freedoms 
for the behavior of 
withdraw from or 


May account 
alienated individuals who 
rebel against O, rather 
than attempting to salvage the relationship. 


RESEARCH DIRECTIONS AND Poricv 
ĪMPLICATIONS or THE PROPOSED 
FRAMEWORK 


The conceptualization of alienation pre- 
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oping a predictive theory wh 
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perience, and behavioral expression of dis- 
illusionment, two dimensions were derived 
which permit a delineation of the spatial and 
temporal features of the alienation experi- 
ence: personal-neutral thwarting and sali- 
ence-nonsalience of alternatives. On the 
basis of these variables, alienation was con- 
ceptualized not only as a particular psycho- 
logical state or configuration of relations 
between O and P but also as a chronological 
process involving a pattern of emotional re- 
actions and behaviors. 

A typology of behavioral patterns based 
upon the thwarting and alternative dimen- 
sions was proposed which included four 
ideal types of response: isolation, reintegra- 
tion, subjugation, and rebellion. Each pat- 
tern was examined in terms of three experi- 
ential stages: a postthwarting period of 
disillusionment, a process of appraisal, and 
eventual reaffirmation or dissolution of com- 
mitment. The levels of estrangement and 
aggression associated with these stages pro- 
vided a basis for distinguishing among quali- 
tatively different syndromes of response to 
alienating conditions, 

Specific predictions pertaining to the be- 
havioral impact of the thwarting and alter- 
natives dimensions can now be advanced: 

_1. The intensity and duration of an indi- 
vidual's estrangement will be greater under 
conditions of personal thwarting, than under 
those of neutral thwarting. 

2. A personally thwarted individual will 
experience more intense and prolonged hos- 
tility than one who has been  neutrally 
thwarted. 

_ 3. (a) During the early stages of dis- 
ilusionment, individuals for whom alterna- 
tive relationships are salient should exhibit 
alte, EBression than those who perceive 
quent appr, to be unavailable. (b) Subse- 
should pit of a thwarting as neutral 

a Miehen uate in reduced aggression and 

H ate of reintegrative behavior, es- 
pecially among persons f 
: i S for whom alterna- 
tives are salient. (c) A " 3 
i " ppraisal of a thwart 
ing as “personal,” howe 1d It in 
& sustained level of i ver, shou resul ir 
aice of appeton ostility and a continu- 
viduals fee da » especially among indi- 

Om alternatives are salient. 


(d) Following a decision to dissolve one's 
commitment to O, overt aggression should 
decline to the extent that prior aggressive 
actions have been ineffective in reducing 
proximity with O. 

These predictions, which assume P’s per- 
ceived or actual proximity with O, are rep- 
resented diagrammatically in Figure 3. As 
a function of the thwarting and alternatives 
components, the response variables of es- 
trangement, hostility, and overt aggression 
have been plotted over time to depict the 

behavioral profiles” of isolation, reintegra- 
tion, subjugation, and rebellion. 

The graphic portrayal of each profile sug- 
gests that any attempt to assess the validity 
of the proposed typology of behavioral syn- 
dromes would require a series of repeated 
measurements taken simultaneously along 
three different dimensions of response. 
Only a time-series approach would permit a 
qualitative distinction between temporary 
states of estrangement, as reflected in neu- 
tral thwarting conditions, and the process 
whereby personally thwarted individuals 
come to feel irrevocably alienated. 

Whereas the behavior associated with 
neutral forms of estrangement would be 
more reflective of transitory motivational 

states (e.g., frustration, reactance), the sus- 
tained hostility and estrangement accom- 
panying personal alienation would reflect 
stable dispositional and cognitive changes 1m 
P. The distinction between “motivational 
(neutral) and “attitudinal” (personal) 
forms of estrangement implies that the 
former would subside following the removal 
of certain objective thwartings (eg. B 
salary, status) while the latter would Beas 
despite subsequent overtures by ONO 
store P to his original ane a -— 
Support for the previously 88577 roce 
tions comes from a recent exped dp 
kols, 1973). In the context of a teed 
HE" » h subject worke 
Productivity Study," eac tually exe 
with three other individuals bes at usi 
perimental accomplices) on 2 p incorpo- 
-perimental sessions 
The 2-hóur vxpe shich together were 
rated three basic phases W^ 7 


versal in the sub- 
designed to create a reversal in 


ject's outcomes. 
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Ficure 3. A typology of behavioral 


of personal-neutral thwarting, 
duration of exposure to situation 
held constant). 
ment, appraisal, and decision. 

predicted responses of estrange: 


During the first phase, the subject’s inter- 
action with the group was rewarding and 
pleasant; he received positive feedback 
from the others and, in general, was led to 
believe that his performance had contrib- 
uted to the success of the group. In the 
second phase, though, he experienced a re- 
versal in the quality of his relationship with 
the group and a reduction in his earnings 
and status relative to those of the other 
group members. He was relegated to a 
mundane and inferior role while the others 
had access to interesting and enjoyable 
tasks. During the final phase, the Subject 
found himself in a role Which permitted him 
to exercise control over the other group 
members; he could choose to ignore, punish, 
or reward them, 


The personal-neutral thwarting dimension 
was manipulated through the assignment of 


salience-nonsalience of alternatives, 
(with perceived proximity to O, an object, 
(Times 1, 2, and 3 correspond to the stages of disillusion- 
The lines shown in each graph depict the 
ment, aggression, and hositility over time.) 


HIGH 


REINTEGRAT ION 


REBELLION 


patterns based upon the dimensions 
and 


the subject to an inferior role during Phase 
2 either by a group vote (personal) or random 
(neutral) process. And the salience or non- 
salience of alternatives was varied through 
the assignment of subjects on the basis of 
seli-esteem prescores (obtained prior to the 
experimental session) to either high-esteem 
or low-esteem experimental conditions. 
The main dependent measures included 
repeated assessments of hostility, overt ag- 
gression, and reported € 
number of Keniston's 
ienation Scales, 
of chronic alienat 


a 
estrangement on " 
(Note 1) Short a 
Preexperimental measure 


H ti- 
] ton and esteem were " a 
lized as covariates in the statistical analys© 


Results indicated Significantly higher levels 
of reported estrangement and hostility a 
ing Phase 3 among personally versus ner 
trally thwarted subjects yet the absence $^ 
between-groups differences in overt agg"? 


n a 
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sion during the final phase of the experi- 
ment. Perhaps most interesting were the 
changes in subjects’ attitudes not only to- 
ward other group members but also about 
themselves—for example, their position on 
certain ideological issues—as a result of 
having experienced a personal thwarting. 


Development of Conceptual Linkages 
Between Alienation and Other Social 
Psychological Phenomena 


The present analysis of alienation is rele- 
vant to a consideration of those conditions 
under which persons are willing to repudiate 
their prior commitments to individuals or 
social systems. Most theories of attitude 
change, especially the consistency theories 
of Heider (1958), Festinger (1957), and 
Bem (1967), emphasize the forces which 
prompt people to remain attitudinally con- 
sistent with their previous behavior or pub- 
lic commitments. These formulations, how- 
ever, generally have not addressed those 
Situations in which individuals refuse to 
ignore or justify the deterioration of a re- 
lationship to which they were previously 
committed, 

Aronson's (1968, 1969, 1972) restate- 
ment of dissonance theory suggests that 
high-esteem persons will feel the most need 
to justify their behavior when its conse- 
quences threaten the integrity of their self- 
concept or esteem. The present discussion, 
though, predicts that high- versus low- 
esteem people, for whom viable alternative 
relationships are characteristically salient, 
will be most likely to renounce their asso- 
Ho mith O, especially when O's actions 

? and insult their self-concept. 

Nee Interesting direction for future re- 
arch would be to explore systematicall 
the Condition o explore systematically 
lend or aol es which people will de- 
in the face of dis their earlier commitments 
ing the appropriate ming evidence regard- 
those commitments > viability, or value of 
to break off with Oo The willingness of P 
May well stem from 


Y of the situation, as con- 
passive ( ariness of O's behavior, 
(voluntar J. edlitional), rather than 

Y^ exposure of P to O's 


insult. In any event, the critical variables 
in this context can only be ascertained more 
fully through subsequent research. 

Another important direction for future 
research pertains to the manner in which as- 
pects of the physical and social environ- 
ment combine to induce stress in the occu- 
pants of a particular behavior setting. In 
the foregoing discussion, various syndromes 
of extrangement have been viewed as the 
result of either personal or neutral thwart- 
ings. Interesting extensions of this discus- 
sion relate to (a) the circumstances under 
which impersonal thwartings will be mis- 
attributed to personal sources and inten- 
tions, and (b) the processes by which per- 
sonal thwartings become associated with 
certain aspects of the physical environment, 
so as to mediate the behavioral and psycho- 
logical impact of physical stressor variables 
(e.g., noise, density, pollution). 

Assuming that ecological pressures can 
place strains on the relations among mem- 
bers of a social system, it becomes impor- 
tant to specify the conditions under which 
neutral thwartings are perceived as personal 
ones, since such misattribution is likely to 
result in a more intense, irreversible syn- 
drome of stress than that which would arise 
from a seemingly unintentional thwarting. 
Specification of these conditions may prove 
to be quite valuable in formulating | social 
planning policies amenable to a variety of 
environmental settings, for example, over- 
sized classrooms and high density neighbor- 
hoods. 

Moreover, an elucidation of the processes 

: un of the environ- 
by which physical features e 
ment acquire cue value in the d enhance 
social thwarting situations shoul i 

7 n response 

our understanding of huma e, crowd- 
environmental stress. B 2 inst chological 
ing has been characterized 79 as deman 
experience in which an 1m ble supply (Sto- 
for space exceeds the availa 


i allel can 
kols, 1972). An interesting paral Ec 
33 H a). à 
be drawn between thi 
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reducing his proximity to O. Thus, con- 
tinued exposure to personal thwartings, es- 
pecially in the context of a high density 
situation (e.g., a crowded home, classroom, 
or work environment) would heighten P's 
sensitivity to density or proximity con- 
straints and lower his resistance against 
crowding stress; the negative cue value of 
density might even extend to situations in 
which O is not present. 

Most recent analyses of human crowding 
do not suggest the circumstances under 
which specific density constraints (e.g., re- 
striction of movement, stimulus overload, 
infringement on privacy, or proximity with 
threatening others) will be more salient 
than others or the particular psychological 
and behavioral concomitants of each. An 
advantage of the personal-neutral thwarting 
dimension, though, is that it suggests a cri- 
terion for determining the intensity, dura- 
tion, and potential reducibility of various 
crowding experiences. 

On the basis of the proposed thwarting 
dimension, a distinction can be drawn be- 
tween "neutral" and "personal" experi- 
ences of crowding (Stokols, Note 2). The 
former variety of crowding would involve 
increased sensitivity to space as a result of 
the frustrations associated with neutral con- 
straints (behavioral restriction, stimulus 
overload). "The latter variety of crowding 
would stem from the security concerns made 
salient by proximity with hostile or unpre- 
dictable persons. Since feelings of crowd- 
ing in the latter case would relate to the 
individual's beliefs about the potential dan- 
gerousness of others, this type of experience 
would tend to be more intense, of greater 
duration, and more generalizable across 
various situations than would transitory ex- 
periences of impersonal density constraints. 


Policy Implications 


Previous studies have noted the relation- 
ship between alienation and a variety of 
social problems. Forward and Williams 
(1970), Gurin and Gurin (1970), and 
Ransford (1968) have discussed the asso- 
ciation between feelings of 


powerlessness 
and racial violence. 


Also, in the context of 
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contemporary concern over the population 
crisis, Groat and Neal (1967) have exam- 
ined the relation between social isolation 
(Seeman, 1959) and the motivation to bear 
children as “compensation” for feeling i50" 
lated. . 

Despite the recognized relationship bez 
tween alienation and social problems, little 
progress has been made toward the develop- 
ment of ameliorative strategies. The lack of 
such progress may be attributable to the 
static nature of previous analyses which 
place a greater emphasis on the measure- 
ment of alienation as a personality dispost- 
tion than upon an analysis of its environ- 
mental origins and process features. 

Some writers have utilized the concepts 
of powerlessness and isolation as a guide for 
developing policies aimed at alleviating al- 
ienation among the poor. Gurin and Gurin 
(1970), for example, suggest that structural 
changes in the social system, leading to in- 
creased . opportunities for experiencing suc- 
cess, provide a means of raising the expec- 
tancies of low-expectancy people. Simi- 
larly, Sarbin (1970), analyzing the “culture 
of poverty" (Lewis, 1961) in terms of role 
theory, concludes that society must create 
opportunities for the poor to realize achieved 
statuses which permit the exercise of choice, 
the experience of social praise, and the de- 
velopment of increased self-esteem. 

The above recommendations, however, ap- 
pear to be too general, undifferentiated, and, 
therefore, of limited value in approaching 
social problems related to alienation, The 
increased provision of roles associated with 
achieved statuses within the system will be 
ineffective as a means of reducing aliena- 
tion if the disaffected individuals see them- 
selves as highly competent and powerful 1 
the first place but ideologically unable t? 
Participate in any reciprocal relationshiP 
with O. Under these circumstances, it may 
be more fruitful for the System to eliminate 
cues which signal an enforced relationsh!P 
between itself and P and to facilitate 
actual or symbolic withdrawal from the X 
lationship as well as his freedom to “do hi 
own thing” outside of the system. 


di- 
The present analysis suggests that & 
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verse set of interventions may be necessary 
in dealing with social problems related to 
alienation, Whether an individual attributes 
his dissatisfaction with society to personal 
factors (such as racial discrimination) or to 
neutral circumstances (e.g., congestion, in- 
formation overload, and anonymity arising 
from high densities) will affect the intensity 
of his alienation, the manner in which it is 
expressed, and its amenability to ameliora- 
tive strategies. Similarly, the salience of 
desirable alternatives will probably influence 
his decision either to seek a reconciliation 
with the system or protest actively and per- 
haps violently against unsatisfactory social 
conditions. 

Where P has been personally thwarted, 
the most effective intervention policy from 
O's point of view would be one which pro- 
vides evidence contrary to P's attributions. 
Rather than trying to impose a forced rela- 
tionship on P, O would attempt to elimi- 
nate P's justification for attributing malevo- 
lent intent to O. The willingness of City 
Hall to facilitate community organization 
and self-government within the ghetto, for 
instance, would probably serve to reduce 
the estrangement of its residents. 

In situations where P is unable to attrib- 
ute a deterioration in his outcomes to ma- 
levolent forces, he will be more amenable to 
cooperating with O as a means of achieving 
a mutuallv satisfactory resolution of their 
joint problems. Under these conditions, P 
will be more receptive to compromise solu- 
tions initiated by O. Increased opportuni- 
ties for participation in the system, then, 
would be particularly appealing to low- 
€xpectancy individuals who previously have 
been isolated from achieved statuses. 

The above speculations are derived from 
basic assumptions concerning the phenome- 
nology of alienation, These assumptions 
have been stated throughout this discussion 
and remain to be validated more fully 
through future research at both the small- 
Sroup and sociocultural levels. 
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Sentence Comprehension: A Psycholinguistic 
Processing Model of Verification 


Patricia A. Carpenter and Marcel Adam Just 
Carnegie-Mellon University 


This paper presents a theoretical account of certain aspects of sentence com- 
prehension, particularly the processes involved in deciding if a sentence is 
true or false. The model proposes that (a) sentences are internally represented 
as an ordered set of constituents in an abstract propositional format, (b) other 
information sources (like pictures) may be represented in a similar format 
in this task, and (c) the corresponding constituents from the sentence repre- 
sentation and the picture representation are serially compared. The model 
accounts for verification latencies for a class of complex sentences that differ 
in the scope of the negation operator (e.g., It isn't true that the dots are red 
versus It's true that the dots aren't red). The predictions are made on the basis 
of a single parameter—the time to find and compare one pair of constituents. 
The model also accounts for verification of counterfactual clauses, implicit 
negatives, universal and particular quantifiers, and sentence recoding. The 
formal structure of the model seems appropriate for dealing with a broad 
domain of language processing. Since the model is expressed in terms of basic 
y mental operations and is incorporated into a more global theory of the im- 
mediate processor, it integrates sentence comprehension with the more gencral 
area of cognitive psychology. 


A major challenge to cognitive science 
has been to explain language comprehen- 
sion within the same information process- 
ing framework that characterizes other 
complex human behavior, such as memory 
scanning (cf. Sternberg, 1969a), concept 
formation (cf. Hunt, Marin, & Stone, 
1966), perception (cf. Haber & Hershen- 
son, 1973), and problem solving (cf. Newell 
& Simon, 1972). This article builds on 
the same theoretical framework to explain 
the mental processes in sentence compre- 
hension. The main focus will be on the 
ene 
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processes involved in verifying a sentence 
as true or false. The model deals with 
the internal representation of a sentence 
in short-term memory and the operations 
applied to that representation in order to 
verify it. The verification process will be 
examined in terms of elementary mental 
operations such as the comparison of two 
elements (Posner, 1969) and control proc- 
esses that govern the sequence in which 
operations are performed (Newell 1972, 
1973). The analytic methods to be used 
in constructing the model treat response 
latencies as measures of the dura 
mental processes (Sternberg, 19692, 19 al 
In the first section of the article, w ae 
develop a model of the verification p! 
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recoding model is based on operations and 
parameters similar to those in the main 
model of verification. Finally, in the third 
section, we will evaluate the generality of 
the model by examining the comprehension 
of a variety of semantic constructions, such 
as quantifiers, counterfactuals, implicit 
negatives, and active and passive sentences. 

The model is most easily explained by 
first outlining the kind of verification task 
and results that motivated its develop- 
ment. In a typical verification task (Just 
& Carpenter, 1971, Experiment 2), a person 
was first shown an array of dots, all of 
which were either red or black, and then 
a sentence like The dots are red or The 
dots aren't red. And, he was timed while 
he looked at the array, read the sentence, 
and decided if it was true or false of the 
picture. So on various trials, the sentence 
could be affirmative or negative and either 
true or false. The two main results of 
the study are easily summarized. First, 
there was an interaction between the kind 
of sentence (affirmative or negative) and 
the type of response. True affirmatives 
were verified faster than false affirmatives, 
but false negatives were verified faster 
than true negatives. The second main 
result was that negative sentences took 
longer to verify. Both of these results 
are common to many other studies (Chase 
& Clark, 1972; Clark & Chase, 1972; 
Gough, 1965, 1966; McMahon, 1963; 
Slobin, 1966; Trabasso, Rollins, & Shaugh- 
nessy, 1971; Wason & Jones, 1963). 

To explain the interaction between af- 
firmative-negative and true-false, Gough 
(1965) suggested that it is easier to com- 
pare the representation of the sentence to 
that of the picture when the two are con- 
gruent. For example, in the case of the 
affirmative sentences, the color predicate 
of the sentence matches the accompanying 

picture in the true case (e.g., The dots are 
red paired with a Picture of red dots) ; 
however, the colors mismatch in the false 
case (e.g., The dots are red paired with 
black dots), Congruence in the true af- 
firmative case explains why it js e 
Process than the false affirm 
For negative 
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in the false case (e.g., The dots aren't red 
paired with red dots) and the mismatch 
occurs in the true case (e.g, The dots 
aren't red paired with black dots). There- 
fore, congruence also explains why false 
negatives are verified faster than true ones. 
In summary, the comparison between sen- 
tence and picture is facilitated if the cor- 
responding elements match and takes 
longer if they mismatch. The extra time 
consumed by a color mismatch is called 
falsification time, and its magnitude is 
approximately the same for affirmatives 
and negatives. 

Congruence relations between the sen- 
tence and picture representations will also 
explain the second main result, why nega- 
tive sentences take longer to verify (Chase 
& Clark, 1972; Trabasso et al, 1971). 
Suppose the representation of a negative 
sentence, like The dots aren't red, contains 
a marker to denote the negative polarity 
of the predication. The negative marker 
will mismatch with the affirmative picture 
representation. This mismatch in the 
negative cases causes the comparison proc- 
ess to take longer than in affirmative cases, 
where both the sentence and picture are 
affirmative. The difference in latencies 
between negative and affirmative sentences 
is called negation time. 

In summary, the two parameters, nega- 
tion time and falsification time, account 
for the verification latencies. Both pa- 
rameters may reflect extra processing time 
caused by mismatches between correspond- 
ing components of the sentence and picture 
representations. The amount of extra 
Processing time was called falsification time 
or negation time, depending on whether 
the predicates or polarity markers 
mismatched. 

We examined the data from previous 
experiments and discovered a very per- 
sistent relation between the values of these 
two parameters, Negation time was either 
two or four times greater than falsification 
time. The discovery that negation ume 
1S an integra] multiple of falsification fie 
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operation is executed. This suggestion led 
to the following model for how the infor- 
mation in sentences is represented and 
compared to other information. 


THE CONSTITUENT COMPARISON 
MODEL 


The proposed model postulates that a 
sentence and picture are represented in 
abstract propositional formats. To verify 
the sentence, the constituents in the two 
representations are compared—tested for 
identity at an abstract level. The number 
of operations in the comparison process is 
the primary determinant of the verifica- 
tion latency. 

The internal representation. Understand- 
ing a sentence involves internally repre- 
senting the information that the sentence 
contains. It is likely that the format of 
the internal representation is propositional, 
a relational structure consisting of a predi- 
cate and one or more arguments (cf. 
Clark & Chase, 1972; Crothers, 1972; 
Kintsch, 1972; Rumelhart, Lindsay, & 
Norman, 1972; Simon, 1972). We will 
use the conventional notation, (PREDICATE, 
ARGUMENT), to denote a proposition. So, 
part of the representation of a simple de- 
clarative sentence like Zhe dots are red 
would be (RED, pots), meaning that red- 
ness is predicated of the dots. Predica- 
or negated. So, the 

of this affirmative 
[arr, (RED, pots) ]. 
A negative sentence like The dots aren't red 
would be represented as [NEG, (RED, 
pots) ]. AFF and NEG are embedding 
markers that denote the affirmative or 
negative polarity of the predication. i his 
form of representation allows us to com- 
bine simple propositions to represent 2 d 
complex sentences. For example, fu 
fortunate that the dots are red might be 
represented as (FORTUNATE, [arr, (RED, 
pots) ]J. ` 

The internal representation ol 
tence is not necessarily linguistic in nature. 
The verbal symbols in these representa- 
tions, for example, DOTS, are used to denote 
More abstract entities. In fact, research 
On sentence-picture verification suggests 
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that there may be a level of representa- 
tion that is neither linguistic nor pictorial 
in nature but can represent information 
from either domain (Chase & Clark, 1972; 
Clark & Chase, 1972). (But this does 
not imply that pictures cannot be repre- 
sented otherwise in other situations, cf. 
Cooper and Shepard, 1973). In these 
verification tasks, a propositional repre- 
sentation may appropriately characterize 
the information encoded from a picture. 
For example, a picture of red dots may 
be represented (RED, DoTS). We assume 
that pictures are generally represented af- 
firmatively, but by convention, we have 
deleted the affirmative polarity marker. 
The presence or absence of this marker 
in the picture representation has no con- 
sequences for any of our proposals, so long 
as it is assumed that the absence of a 
marker denotes affirmative polarity. The 
picture representation may be influenced 
by the sentence that precedes it. In par- 
ticular, if the picture has several possible 
interpretations, it is coded so that it has 
the same argument as the sentence repre- 
sentation (Carpenter & Just, 1972; Car- 
penter & Just, in press; Just & Carpenter, 
1971). 

The detailed form of the representation 
of various kinds of sentences has not yet 
been empirically verified. For example, 
the research on the linguistic factors that 
determine the psychological predicate— 
argument structure has only begun (cf. 
Halliday, 1967; Hornby, 1972). Moreover, 
there are cases where various representa- 
tions are formally equivalent, and the 
selection of one particular form is some- 
what arbitrary. Nevertheless, our con- 
ventional notation is sufficient for the 
current model and. promises to be flexible 
enough to accommodate a variety of lin- 
guistic structures (cf. Kintsch, 1972). 

The same sentence may be represented 
differently in different situations. This 
follows from the assumption that the rep- 
resentation contains the information that 
extracts from a sentence. What 
xtracted depends on the 
e situation in which 
1, and the listener's 
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recoding model is based on operations and 
parameters similar to those in the main 
model of verification. Finallv, in the third 
section, we will evaluate the generality of 
the model by examining the comprehension 
of a variety of semantic constructions, such 
as quantifiers, counterfactuals, implicit 
negatives, and active and passive sentences. 
The model is most easily explained by 
first outlining the kind of verification task 
and results that motivated its develop- 
ment. In a typical verification task (Just 
& Carpenter, 1971, Experiment 2), a person 
was first shown an array of dots, all of 
which were either red or black, and then 
a sentence like The dots are red or The 
dots aren't red. And, he was timed while 
he looked at the array, read the sentence, 
and decided if it was true or false of the 
picture. So on various trials, the sentence 
could be affirmative or negative and either 
true or false. The two main results of 
the study are easily summarized. First, 
there was an interaction between the kind 
of sentence (affirmative or negative) and 
the type of response. ‘True affirmatives 
were verified faster than false affirmatives, 
but false negatives were verified faster 
than true negatives. The second main 
result was that negative sentences took 
longer to verify. Both of these results 
are common to many other studies (Chase 
& Clark, 1972; Clark & Chase, 1972; 
Gough, 1965, 1966; McMahon, 1963; 
Slobin, 1966; Trabasso, Rollins, & Shaugh- 
nessy, 1971; Wason & Jones, 1963). i 

To explain the interaction between af- 
firmative-negative and true-false, Gough 
(1965) suggested that it is easier to com- 
pare the representation of the 
that of the picture when the two are con- 
gruent. . For example, in the case of the 
affirmative sentences, the color predicate 
of the sentence matches the accompanying 
picture in the true case (e.g., The dots are 
red paired with a picture of red dots); 
however, the colors mismatch in the false 
case (e.g., The dots are red paired with 
black dots). Congruence in the true 
firmative case explains why it is easier to 
process than the false affirmative Case. 
For negative sentences, the match occurs 


sentence to 


af- 
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in the false case (e.g., The dots aren't red 
paired with red dots) and the mismatch 
occurs in the true case (e.g., The dots 
aren't red paired with black dots). There- 
fore, congruence also explains why false 
negatives are verified faster than true ones. 
In summary, the comparison between sen- 
tence and picture is facilitated if the cor- 
responding elements match and takes 
longer if they mismatch. The extra time 
consumed by a color mismatch is called 
falsification time, and its magnitude is 
approximately the same for affirmatives 
and negatives. 

Congruence relations between the sen- 
tence and picture representations will also 
explain the second main result, why nega- 
tive sentences take longer to verify (Chase 
& Clark, 1972; Trabasso et al., 1971). 
Suppose the representation of a negative 
sentence, like The dots aren't red, contains 
a marker to denote the negative polarity 
of the predication. The negative marker 
will mismatch with the affirmative picture 
representation. This mismatch in the 
negative cases causes the comparison proc- 
ess to take longer than in affirmative cases, 
where both the sentence and picture are 
affirmative. The difference in latencies 
between negative and affirmative sentences 
is called negation time. 

In summary, the two parameters, nega- 
tion time and falsification time, account 
for the verification latencies. Both pa- 
rameters may reflect extra processing time 
caused by mismatches between correspond- 
ing components of the sentence and picture 
representations. The amount of extra 
processing time was called falsification time 
or negation time, depending on whether 
the predicates or polarity 
mismatched. 

We examined the data from previous 
experiments and discovered a very per- 
sistent relation between the values of these 
two parameters. Negation time was either 
two or four times greater than falsification 
time. The discovery that negation time 
35 an integral multiple of falsification ime 
suggested that are ed 
termined by ion, pu 
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operation is executed. This suggestion led 
to the following model for how the infor- 
mation in sentences is represented. and 
compared to other information. 


THe CONSTITUENT COMPARISON 
MODEL 


The proposed model postulates that a 
sentence and picture are represented in 
abstract propositional formats. To verily 
the sentence, the constituents in the two 
representations are compared—tested for 
identity at an abstract level. The number 
of operations in the comparison process is 
the primary determinant of the verifica- 
tion latency. 

The internal representation. Understand- 
ing a sentence involves internally repre- 
senting the information that the sentence 
contains. lt is likely that the format of 
the internal representation is propositional, 
a relational structure consisting of a predi- 
cate and one or more arguments (cf. 


Clark & Chase, 1972; Crothers, 1972; 
Kintsch, 1972; Rumelhart, Lindsay, & 
Norman, 1972; Simon, 1972). We will 


use the conventional notation, (PREDICATE, 
ARGUMENT), to denote a proposition. So, 
part of the representation of a simple de- 
clarative sentence like The dots are red 
would be (RED, pots), meaning that red- 
ness is predicated of the dots. Predica- 
tions can be affirmed or negated. | So, the 
entire representation of this affirmative 
sentence would be [arr, (RED, pots) ]. 


A negative sentence like The dots aren t red 
would be represented as [NEG, (RED, 
pots) | are embedding 


arr and NEG ] : 
markers that denote the affirmative 6 
lication. T his 


negative polarity of the pret s 
form of representation allows us to com 
bine simple propositions tO represent more 
complex sentences. For example, It as 
fortunate that the dots are red. might be 
represented as (FORTUNATE, [AFF, (RED, 
pots) ]). 

The internal representation of a sen- 
tence is not necessarily linguistic in nature. 
The verbal symbols in these representa- 
tions, for example, DOTS, are used to denote 
more abstract entities. In fact, research 
On sentence-picture verification suggests 


that there may be a level of representa- 
tion that is neither linguistic nor pictorial 
in nature but can represent information 
from either domain (Chase & Clark, 1972; 
Clark & Chase, 1972). (But this does 
not imply that pictures cannot be repre- 
sented otherwise in other situations, cf. 
Cooper and Shepard, 1973). In these 
verification tasks, a propositional repre- 
sentation may appropriately characterize 
the information encoded from a picture. 
For example, a picture of red dots may 
be represented (RED, pors). We assume 
that pictures are generally represented af- 
firmatively, but by convention, we have 
deleted the affirmative polarity marker. 
'The presence or absence of this marker 
in the picture representation has no con- 
sequences for any of our proposals, so long 
as it is assumed that the absence of a 
marker denotes affirmative polarity. The 
picture representation may be influenced 
by the sentence that precedes it. In par- 
ticular, if the picture has several possible 
interpretations, it is coded so that it has 
the same argument as the sentence repre- 
sentation (Carpenter & Just, 1972; Car- 
penter & Just, in press; Just & Carpenter, 
1971). 

The detailed form of the representation 
of various kinds of sentences has not yet 
been empirically verified. For example, 
the research on the linguistic factors that 
determine the psychological predicate- 
argument structure has only begun (cf. 
Halliday, 1967 ; Hornby, 1972). Moreover, 
there are cases where various representa- 
tions are formally equivalent, and the 
selection of one particular form is some- 
what arbitrary. Nevertheless, our Con- 
ventional notation is sufficient for the 
current model and promises to be flexible 
enough to accommodate a variety of lin- 
guistic structures (cf. Kintsch, 1972). 
^ The same sentence may be represented 
differently in different situations. ‘This 
follows from the assumption that the rep- 
resentation contains the information ise 
a person extracts from a sentence. What 
ion is extracted depends on the 
nces, the situation in which 
d, and the listener's 


informat 
receding sente 


the sentence is embedde 
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Increment 
n 


Counter 
+1 


Set Response Index to True 
Represent Sentence 
Represent Picture 


Set the Constituent 
Counter 


Find and Compare 
the n'h Constituents 
Do they match? 


Constituents 
been Compared? 


No 


Tag Mismatch 
Change Index 


FiGURE 1. 


previous knowledge. In other words, con- 
text plays a role in how a sentence is 
represented, suggesting an important dis- 
tinction between the psychological notion 
of an 


internal representation and the 
traditional linguistic notion of deep 
structure. 


The mental operations. The main focus 
of the model is on the operations that 
compare the sentence and picture repre- 
sentations. The model postulates that the 
Corresponding constituents from the two 
Tepresentations are retrieved 
pared, pair by pair. 
of these retrieve-an 
is assumed to be th 
of the pattern of 


and com- 
Moreover, the number 
d-compare operations 
€ primary determinant 
verification latencies, 
Figure 1 shows the proposed process in 
flowchart form. The representations' 
propositional structure and embeddings 
Provide an ordering relation on the con- 
Stituents. This ordering determines the 
sequence in which constituents 
pared. Inner propositions are 
before polarity markers, 
in a sentence representation is assumed to 
match the absence of a marker in the 
picture representation, since pictures are 
generally encoded affirmatively, The 
"find-and-compare" process is a serial, 
iterative operation that can be applied to 
representations with multiple embeddings, 


are com- 
compared 
Án AFF marker 


Execute Index| 


"The constituent comparison model. 


This iterative operation will allow the 


model to be generalized without additional 
assumptions. 


A central assumption is that whenever 
two corresponding constituents from the 
Sentence and picture representation mis- 
match, then the entire comparison process 
is reinitialized. 
from looping forever on mismatching con- 
Stituents, we assume that the first time 
a mismatch is discovered, the two con- 
stituents involved are tagged, so that on 
subsequent recomparisons the two will be 
treated as a match. 

Since mismatches cause the comparison 
process to be reinitialized, the total number 
of comparison operations, and consequently 
the total latency, increases with the num- 
ber of mismatches. Moreover, a mismatch 
that occurs later in the comparison process 
results in more recomparisons than a mis- 
match on earlier constituents. So the 
total latency is a function of both the 
number of mismatches and their locus in 
their respective represent. 

A response index moni 
and mismatches betw 
index has two 


ations, 

tors the matches 
een constituents. The 
possible States, ¿rue and 
false, At the beginning of each trial, its 
initial State is true, but each mismatch 
causes 1t to change from its current state 
to its other State. The time spent in 


To prevent the process. 


a 
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changing the response index (and for that 
matter, tagging mismatching constituents) 
is assumed to be negligible relative to the 
time to perform the find-and-compare 
operation. 

When the model is applied to the pro- 
posed sentence and picture representations 
in the Just and Carpenter (1971) experi- 
ment, it can account for the latencies in 
the [our conditions, as shown in Table 1. 
In Table 1, a match between two con- 
stituents is designated by a plus (+) under 
them, and a mismatch by a minus (—). 
In this notation, the comparisons proceed 
from right to left until a mismatch is 
encountered. A reinitialization is denoted 
by a new line of pluses and minuses. In 
the simplest case, the true affirmative, 
there are no mismatches between the sen- 
tence and picture representations. The 
first comparison, between the inner propo- 
sitions, results in a match. The second 
comparison, between polarity markers, also 
results in a match. (The Arr marker in 
the sentence representation is presumed 
to match the absence of any polarity 
marker in the picture representation). 
Thus after a total of two constituent com- 


parisons, the truth index is still set to true, 
and this response is executed. The number 
of constituent comparisons in. the true 
affirmative case serves as the baseline for 
the other conditions and will be referred 
toas k. Here, k equals 2. 

l In the false affirmative condition, the 
inner propositions of the sentence and 
picture mismatch. The mismatch will re- 
initialize the comparison process, causing 
one extra comparison above the base 
number. Table 2 shows the consequences 
of this mismatch in detail. The mis- 
matching constituents are marked with a 
tag (denoted in Table 2 with an M), and 
the response index is set to false. After 
reinitialization, the tagged inner constit- 
uents are compared, and they match. 
The next comparison, between the polarity 
markers, also results in a match. So, the 
response false is executed after a total of 
k + 1 constituent comparisons. 

In the false negative condition, there 
will be a total of & + 2 comparisons, due 
to the mismatch on the second constituent, 
the polarity markers. This mismatch will 
cause the response false to be executed. 
For the true negative condition, both the 


TABLE 1 


REPRESENTATI 


ONS AND PREDICTIONS FOR THE FOUR INFORMATION CONDITIONS 


Stimulus and representation 


True affirmative 


| False affirmative 


EE addi 
The dots are red. The dots are red. 
Sentence The | Drm 
Picture Red dots | Black a - 
Sentence representation (AFF, M in (arr, rer wee 
i ; RED, | LA i 
Picture representation " ja i index — false 
=r is 


| 
| 
| 
| k comparisons 
| 
| 


Sentence 
Picture 


Red dots 


response = true 


a 
“The dots aren t red. 


| [NEG, (RED, pors) ] 


response = falsc 
| k + 1 comparisons 


True negative 


negative | a 


"The dots aren't red. 
| Black dots 
[xEG, (RED, pors)] 
(BLACK, DOTS) 


Sentence r ; 
Pigtuce ee sri m | (RED, DOTS) (alse | E index 2 false 
REOR = zu judex e^ a 4 index =true 
* d | + s 
response — fals response = true 
k+2 comparisons k43 comparisons 


gns denote matches and mis 
tialization of the compari 


mismatches of the corre 
son process- 


ch horizontal line 


of plus and 


€— 


sponding constituents. ™ 
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TABLE 2 


A TRACE OF THE OPERATIONS IN VERIFYING A FALSE 
AFFIRMATIVE 


Stimulus 


The dots are red. 
A set of black dots 


Sentence 
Picture 


Operations Trace 


Initialize response index to 
true | 

Represent sentence 

Represent picture 

Compare first constituents — 
Tag sentence constituent | [arr, ( M )] 
Tag picture constituent € St 3} 
Change index to false 
Reinitialize comparison 


(AFF, (RED, Dots) ] 
(BLACK, LOTS) 


process 
Compare first constituents + 
Compare second constituents | -+ 
Respond with content of 
index False 


Number of comparisons |k + 1, wherek = 2 


Note. Abbreviation: M denotes a neutral tag. 


first and the second constituents mismatch 
so the response /rue will be executed after 
a total of k + 3 constituent comparisons. 

The model postulates that verification 
latencies should be a direct function of 
the number of constituent comparisons. 


Slope = 215 msec/Comparison 
1800} 
ç 
o H 
o 
E 
> 1400 F 
c 
£z F 
o 
+ 4000+ 
— fi  —— —Ü 
K K+1 K+2 K+3 
(TA) (FA) (FN) (TN) 


NUMBER OF CONSTITUENT COMPARISONS 


FIGURE 2. The fit of the constituent comparison 
model to the data from Just and Carpenter (1971). 
(Abbreviations: TA = true affirmative, FA 
affirmative, FN = false negative, T 
negative.) 


The number of comparisons, and hence the 
latency, should increase linearly from true 
affirmatives (2) to false affirmatives (k + 1) 
to false negatives (k + 2) to true nega- 
tives (k + 3). 

The verification latencies for The dots 
are(n’t) red, as well as the best fitting 
straight line, are shown in Figure 2 (data 
from Just & Carpenter, 1971, Experi- 
ment 2). The predictions of the model fit 
the data quite well. The model accounts 
for 98.0% of the variance among the four 
means. The slope is 215 msec per con- 
stituent comparison. 

Only one parameter, the time to find 
and compare a pair of constituents, is 
necessary to describe the mental opera- 
tions that are involved in these four con- 
ditions.! Thus, the current model is a 
very parsimonious explanation of these 
latencies. Although parsimony is desira- 
ble, the model should be evaluated on the 
basis of its ability to offer a rigorous 
formulation that both is a plausible mental 
process and can incorporate a wide variety 
of empirical results. This ability will be 
demonstrated in each of the following 
sections. The nature of the propositional 
representation, particularly the embedding 
feature, should allow the model to be 
generalized to more complex sentences 
without additional modifications. This 
property of the model was tested by 
examining embedded affirmative and nega- 
tive sentences. 


Tests of the Model 


Experiment 1 


We devised a more stringent test of the 
model by systematically varying the scope 
of the negative—the range of constituents 
to which the negative applies (Jackendoff, 


_ | OF course, a second parameter for the intercept 
18 necessary to describe a straight line. ‘The inter- 
cept time is a wastebasket parameter containing 
the durations of all of the processes which are 
common to all the conditions. In this spirit, we 
will refer to the “one” parameter of our model, 
namely the slope, when there are really two. 
Similarly, we will refer to the “two” patate 
of the Clark and Chase (1972) and Trabasse et al. 
(1971) model when there are really three- 


E 


) 
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1969; Klima, 1964; Weinreich, 1963). We 
varied the scope by using affirmative sen- 
tences with the superordinate clause, J? is 
true that. . ., for example, Iż is true that 
the dots are red. Such an affirmative can 
be negated in two ways. With one type 
of negation, the negative has a small scope, 
namely the inner predication: /£ ts true 
that the dols aren't red. This will be called 
predicate negation. The second type of 
negation has a large scope since the nega- 
tive is in the superordinate clause where 
it negates the entire inner proposition: It 
isn't true that the dots are rez This type of 
negation will be called denial. These two 
kinds of negation may be represented dif- 
ferently, and they should have different 
verification latencies. The precise predic- 
tions can be derived by examining how 
the comparison process might operate on 
the hypothesized representations for these 
types of sentences.” 

The representation of an affirmative 
sentence like It’s true that the dots are red 
may be the same as for the simple sen- 
tence The dots are red, namely, LAFF, (RED, 
pots)]. The rationale is that the em- 
bedding clause //'s true . . . does not change 
the truth value and so it can be ignored. 
To demonstrate this point, consider a con- 
catenation of this type of clause, for 
example, Jis true that it’s true that it’s 
true . . . the dots are red. The number of 
such embedding clauses is irrelevant to the 
truth value of the proposition. Similarly, 
the embedding affirmative clause may be 
deleted from the representation of a predi- 
cate negative sentence like It's true that 
the dots aren't red, so that the representa- 
tion would be [NEG, (RED; poTs)].. How- 
ever, the representation of a denial like 
Tt isn’t true that the dots are red must 
include the negative embedding clause. 
Here the embedding clause does affect the 
truth value of the sentence. Thus, denial 
sentences might be represented (xEG, LAFF, 
(RED, Dots) ]}. The pictures. would be 
represented as simple propositions C 
(RED, DOTS) or (BLACK, pots). Table 


BE eic 
2 These two types of negatives have the mm 
truth value here, although this is not always the 


case (Fillmore, 1963; Lakoff, 1970). 


shows examples of the representations in 
the six conditions. 

The predictions of the model can be 
derived by examining the flowchart in 
Figure 1 and the representations in Ta- 
ble 3. Table 3 details the process, showing 
the comparisons and outcomes for each of 
the six information conditions. The pre- 
dicted number of operations necessary to 
verify a true affirmative is k; a false 
affirmative is k+ 1; a false predicate 
negative is k + 2; a true predicate nega- 
tive is k + 3; a false denial is & + 4; and 
a true denial is k +5. The verification 
latencies should increase linearly with the 
proposed number of operations. If this 
result is obtained, it will constitute strong 
support for the constituent comparison 
model and the notion of a single under- 
lying iterative operation. 


Method. The experiment was a verification task 
in which the subject was timed while he read a 
sentence, looked at a picture, and then decided 
whether the sentence was true or false of the 
picture. 

"The stimulus sentences were the three in Table 3, 
as well as six more in which the adjectives black 
or green replaced the adjective red. The picture 
was an array of 16 dots of one color, either red, 
green, or black. Table 3 shows the truth values 
for the various combinations of sentences and pic- 
tures. Each affirmative sentence could be false 
with respect to two pictures and true with respect 
to one picture; consequently, the one true sen- 
tence-picture combination was presented twice. 
The sentence-picture combinations for the false 
negative conditions were presented twice for the 
same reason. Thus, there was a total of 36 sen- 
tence-picture combinations. Each sentence was 
typed in elite type and subtended 10? of visual 
angle. The 4 X 4 array of dots, which was drawn 
below the sentence with a felt tip pen, subtended 
2.5? X 2.5? of visual angle. The stimulus card 
was viewed in a tachistoscope at a distance of 
58.4 cm. 

'The subject initiated a trial by pushing a switch, 
and 500 msec later the stimulus card was presented 
and remained in view until the subject responded. 
The subject's decision was made using a two- 
button decision apparatus. The assignment of 
dominant hand to true button was balanced across 
subjects. The subject was given feedback about 
the correctness of his answer only during the 
practice session which consisted of 10 trials selected 
at random from the ensemble of 36. After the 
all 36 cards were shuffled and the subject 
The testing 


practice, c 
went through 7 blocks of 36 trials. 
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TABLE 3 
REPRESENTATIONS AND PREDICTIONS FOR THE SIX [NFORMATION CONDITIONS 
Stimuli and representations True affirmative False affirmative 
Sentence It's true that the dots are red. | It's true that the dots are red. 
Picture Red dots | Black dots 


Sentence representation 
Picture representation 


+ + 


[AFF, (RED, DOTS)] 
(RED, DOTS) 


[ArrF, (RED, DOTS) ] 
(BLACK, DOTS) 
— index = false 


response — true | # sls 


| k comparisons 


Picture Red dots 


False predicate negative 


t 
| response = false 

k + 1 comparisons 
| 


True predicate negative 


Black dots 


Sentence | It's true that the dots aren't red. | It's true that the dots aren't red. 
| F 


Sentence representation (NEG, (RED, Dors)] (NEG, (RED, Dots) ] 

Picture representation (RED, DOTS) | (BLACK, DOTS) 
| — + index false — index false 
Ms d E af: index = true 
| response = false + + 


k + 2 comparisons 


response = true 
k + 3 comparisons 


False denial 


True denial 


Sentence 

Picture 

Sentence representation 
Picture representation (R 


| Red dots 


| It isn't true that the dots are red. 


| [NEG, [Arr, (RED, pors) | 


It isn't true that the dots are red. 
Black dots 
{NEG, [AFF, (RED, pots) }} 


RED, DOTS) (BLACK, DOTS) 


— + + index = false | — — index =false 
+ + + }; = + + index = true 
response = se + + 4 


k + 4 compari 


T igns denote matches and mismatches 
minus signs indicates a reinitialization of the comparison proce: 


session lasted about 50 minutes, The subjects 
were 24 students of Carnegie-Mellon University 
who were paid for their services. 


Results. The latencies for erroneous 
responses (3.2%) were discarded, and the 
subject's score for each condition was the 
mean of his correct latencies for that con- 
dition. As predicted, the mean latencies 
increased linearly with the number of 
hypothesized constituent comparisons, 
More precisely, latencies increased an 
average of 200 msec [or each additional 
constituent comparison (standard error 
= 23 msec). Figure 3 shows this result 
along with the best fitting straight line: 


The model accounts for 97.7%, of the vari- 


sons response = true 
k + 5 comparisons 


of the corresponding constituents, Each horizontal line of plus and 
SS. 


ance among the six means, 
= 17117, $ «& 01. The residual 2.3% is 
not significant, F (4, 115) = 1.01. The 
root mean squared deviation of 52 msec 
1s small relative to the 200 msec parameter. 
This analysis confirms the major hypoth- 
esis that verification time increases linearly 
with the number of constituent comparisons 
The error rates for the six condition? 
Were correlated with the latencies qe xe 
as is shown in Figure 3, This correlati?" 
indicates that the probability of erro zed 
creases with the number of hypothe 
Operations, 
Discussion, 
processing time 


J 


as i 
The model pr^ titions on 
for these six c9" 


F (1, 115). 
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the basis of a single parameter: the time 
to find and compare a pair of constituents. 
This strongly supports the hypothesis that 
a single iterative operation underlies the 
processing of affirmative and negative sen- 
tences. These data also support the hy- 
pothesis that the two negatives are repre- 
sented differently and that the essence of 
this difference lies in the locus of the 
negative in the internal representation. 

A further control study showed that the 
representation and processing of the sen- 
tence is determined by its semantics rather 
than by its surface structure. To show 
this, we compared the processing of sen- 
tences that had the same constituent 
structure but different surface structures. 
In this control study, the inner proposi- 
tions of the sentences were embedded in 
two ways: the same way as in Experi- 
ment 1 (eg. It's true that the dots aren't 
red) and with the embedding clause at 
the end of the sentence (e.g., That the 
dots aren't red is true). Both of these 
kinds of sentences are postulated to have 
the same underlying constituent structure. 
However, the position of the negative in 
the surface structure has been changed. 
If the results of the basic experiment can 
be replicated with the new sentences, then 
the results must be due to the postulated 
rather than the surface 


new stimulus sentences 
were: Affirmative—That the dots are red 
is true; predicate negative—That the dots 
aren't red is true; and denial—That the 
dots are red isn't true. Thus, there were 
three basic sentence types, two kinds of 
surface structures and the true-false vari- 
able. In addition, each sentence had the 
adjective either red or black, for a total 
of 24 stimuli. The pictures were 4X4 
arrays of all red or all black dots. The 
12 paid subjects were run for 2 blocks of 
24 trials with a procedure identical to that 
of Experiment 1. 

This control study showed that the 
types of surface structures were proces 
similarly. Both types of sentences showed 
a linear increase in latency as the number 
of comparisons increased. Regardless of 
whether the negative morpheme occurred 


representation 
structure. The 
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"—2 
SENTENCE AND PICTURE 
SIMULTANEOUS 
Slope = 200 Msec/Comparison 
I 
2800 2 
3 
22400 
> 
E 
$2000 
E] 


1600 


K K+H Kt2 K«3 K+4 K+5 
(TA) (FA) (FP) (TP) (FD) (TO) 


NUMBER OF CONSTITUENT COMPARISONS 


Ficure 3. The fit of the constituent comparison 
model to the results of Experiment 1. (Abbrevia- 
tions: TA = true affirmative, FA = false affirma- 
tive, FP = false predicate negative, TP = true 
predicate negative, FD = false denial, TD = true 
denial.) 


near the beginning or end of the sentence, 
denials took about 500 msec longer to 
verify than predicate negatives. This 
result shows that the underlying constit- 
uent structure, rather than order of nega- 
tives in the surface structure, determines 
processing time and constitutes further 
support for the proposed representations 
for the two kinds of negative sentences. 
The constituent comparison model cor- 
rectly predicts that the pattern of latencies 
in this control experiment should be the 
same as in Experiment 1. Whether the 
embedding string was at the beginning or 
end of the sentence, there was a linear 
increase in latency with the number of 
hypothesized constituent comparisons. 
The best fitting straight line for sentences 
that were the same as in Experiment 1 
had a slope of 234 msec/comparison, 
— 28 msec. The linear 
trend was significant, F (1, 121) = 55.24, 
p< 01. The model accounts for 97.4% 
of the variance among the six means. The 
residual was not significant, F (4, 121) < i 
The best fitting line for the sentences with 
the new serial order had a slope of 286 


msec/comparison, standard error = 40 


standard error 
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msec. The linear trend was significant, 
F (1, 121) = 82.77, p < .01. The model 
accounts for 95.5% of the variance among 
the six means. The residual for this new 
serial order was also not significant, 
F (4, 121) € 1. Finally, the two slopes 
were not significantly different from each 
other, F (1, 121) — 1.38. Thus, the serial 
position of the negative in the suríace 
structure of the sentence does not alter 
the underlying linearity of the results. 

This control study also rules out an 
alternative interpretation of Experiment 1. 
In Experiment 1, the scope of the negative 
morpheme is correlated with its serial 
position in the surface Structure of the 
sentence. The negative in a denial sen- 
tence occurs near the beginning of the 
sentence, while the negative in a predicate 
negative occurs at the end. Thus, the 
increase in verification latencies was cor- 
related with the serial position of the 
negative morpheme. One might argue that 
negatives at the beginning of a sentence 
impose a greater memory load, and this 
greater memory load slows down subse- 
quent comparison processes. This argu- 
ment is countered by the result that the 
serial position of the negative is not re- 
sponsible for the difference in negation time. 


Experiment 2 


Because of its focus on comparison pro- 
cesses, the model ignores the Possibility 
that some sentences take longer to read 
than others. For example, it may take 
longer to read and represent a negative 
sentence than an affirmative one. Both 
the reading and representation time, as 
well as the comparison time, were included 
in the verification latencies in Experi- 
ment 1. Experiment 2 was designed to 
test whether latencies still increased line- 
arly when reading and representation time 
were eliminated from the verification la- 
tencies. Subjects were given 2 sec to fead 
and represent each sentence before the 
picture. appeared. Verification time was 
measured only from the onset of the sila 
ture. If all reading and representation 
processes are complete after 2 sec, then 
any remaining differences in the sub- 
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sequent verification time can safely be at- 
tributed to the comparison process. It was 
decided that 2 sec was sufficient reading 
time on the basis of a small study where 
two naive subjects were timed as they 
read a sentence and indicated they had 
apprehended it by closing a microswitch. 


Method. ‘The materials and procedure were the 
ne as in Experiment 1, except for the following 
difference. After the subject pushed the switch 
to initiate a trial, the sentence appeared in the 
annel of the tachistoscope for 2 sec and 
then disappeared the picture appeared in the 
bottom channel. The subject was timed from the 
onset of the picture until he made a response. 
Thus, the response latencies should not include 
the time to read and represent the sentence. The 
subjects were 24 students at Carnegie-Mellon Uni- 
versity who were paid for their services. 


Results and discussion. The results were 
first analyzed. for only the first block of 
trials, since previous pilot studies of such 
sentence-first conditions indicated a sys- 
tematic change in the pattern of latencies 
across blocks of trials. The results for 
the later blocks and the justification for 
the partitioning of the d 
cussed in detail below, 

The subject’s score for each condition 
Was the mean of his correct latencies for 
that condition. As predicted, the mean 
latencies increased linearly with the number 
of hypothesized constituent comparisons. 
Figure 4 shows this result, along with the 
best fitting straight line, The verification 
time increases an average of 169 msec for 
each additional constituent comparison 
(standard error = 21 msec). The model 
accounts for 97.9% of the Variance among 
the six means, P (1, 115) = 136.54, p 
< .01. The residual 2.1% is not signifi- 
cant, P (4, 115) < 1. The root mean 
Squared deviation of 42 msec is small 
relative to the 169 msec parameter. The 
error rate was low (6.5%). As in Experi- 
ment 1, the error rates for the six condi- 
tions Were correlated (p = 81) with the 
latencies, 
The d 


ata will be dis- 


tion in 
time per 
The estima in Experi- 
31 msec more than ! 
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ment 2, where reading and representation 
time were eliminated. A / test comparing 
the slopes of the subjects in the two ex- 
periments showed no significant difference, 
t (46) <1. Since the 200 msec slope in- 
volves seven times as much practice as 
the 169 msec slope, this comparison should 
be viewed-with caution. In any case, it 
is clear that verification time increases 
linearly with the number of hypothesized 
constituent comparisons, regardless of 
whether reading and representation time 
is included. 


The Control Structure of the Comparison 
Process 


The processes that operate on the sen- 
tence and picture representations must 
have some control structure that governs 
the flow of activity in short-term memory 
(Atkinson & Shiffrin, 1968; Newell, 1972, 
1973). We have written a computer simu- 
lation of the constituent comparison model 
and integrated it with a more global theory 
of the control structure of short-term 
memory, developed as a production system 
by Newell (1973). One purpose of the 
simulation was to make the control struc- 
ture of the verification process explicit. 
A production system specifies exactly what 
conditions of short-term memory will trig- 
ger a particular operation and how that 
operation will affect the content of the 
memory. 

A production system has two main com- 
(a) a short-term memory con- 
sisting of n ordered items, and (b) a list 
of independent production rules that oper- 
ate on the items, Each rule consists ol 
conditions and actions. Whenever the 
items in short-term memory satisfy the 
conditions of a particular rule, then the 
actions associated with that rule are per- 
formed. An action may change the con- 
tents of short-term memory by rearrang~ 
ing, adding, deleting, or modifying 
individual items. When the memory 19 
filled to capacity, insertion of a new item 
displacement or forgetting of the 
After an action is taken, the 
y may satisfy the 
One rule 


ponents: 


causes 
oldest item. 


contents of the memor 
Same rule or à different one. 
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Ficure 4. The fit of the constituent comparison 
model to the results of the first block of Experi- 
ment 2. (Abbreviations: TA = true affirmative, 
FA = false affirmative, FP = false predicate nega- 
tive, TP = true predicate negative, FD = false 
denial, TD = true denial.) 


after another is executed until no rule is 
satisfied or until a "stop" rule is executed. 

At the beginning of the verification 
process, there are three items in the 
memory: a response index that is set to 
"true," a sentence representation, and a 
picture representation. Since these three 
items are essential for performing the task, 
they are repeatedly rehearsed, preventing 
them from being forgotten. There are five 
other items that play a role in the verifi- 
cation process. First of all, when two 
corresponding constituents are to be com- 
pared, a copy of each is retrieved from 
the sentence and picture representations. 
These copies constitute two additional 
items in the memory. Next, there is a 
place holder for each of the two repre- 
sentations that keeps track of which items 
were last retrieved and compared. Finally, 
another item, called "mismatch," is gen- 
erated when two corresponding constit- 
uents mismatch. Thus, there are eight 
possible items involved in performing the 
task, and the memory is postulated to have 
a capacity of seven items. 

Although the flowchart model and the 


production system both account for per- 
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formance in the same task, there are two 
respects in which the models differ. One 
difference concerns the reason for reini- 
tializing the comparison process aíter a 
mismatch. The flowchart explanation 
treats the comparison process as one sub- 
routine and tagging and changing the re- 
sponse index as a separate subroutine. 
Once the comparison subroutine is inter- 
rupted and control is transferred to the 
tagging subroutine, the comparison sub- 
routine must start from the beginning. 
In the production system, the reinitializa- 
tion results from the limitations of short- 
term memory. In short, there is a rule 
whose condition is a mismatch between 
two corresponding constituents. When the 
condition is satisfied, the action associated 
with the rule is performed, namely, the 
insertion of a “mismatch” item into mem- 
ory. The presence of this “mismatch” 
item is the condition of a rule that updates 
the value of the response index. The 
“mismatch” item displaces a place holder 
that indicates which constituents were last 
compared. The absence of the place holder 
is eventually the condition of a rule that 
says, roughly, if there is no place holder, 
start the comparison process with the first 
constituent. This latter rule serves two 
purposes. It initializes the verification 
process at the beginning of a trial, and it 
reinitializes the process after a mismatch 
has displaced a place holder. In summary, 
a mismatch generates intervening items 
and operations, like updating the response 
index and tagging constituents. This 
makes the comparison process "forget" 
which constituents were last compared, 
and it consequently restarts from the 
beginning. 

A second dillerence between the two 
models is in the nature of the "find-and- 
compare" operation. The flowchart model 
characterizes it as a unitary process. By 
contrast, the production system reveals 
that there are a number of suboperations 
in the find-and-compare operation, 
which can be classified as “find” 
one of which is "compare." 
tinction between “find” and 
may be clarified by 


most of 
and only 
This dis- 
“compare” 
considering how we 
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mentally multiply two three-digit numbers 
(e.g., 245 X 314 = ?). In this task, most 
of the processing time seems to be spent 
in retrieving digits from various positions 
and keeping track of them; relatively little 
time seems to be spent in actually multi- 
plying or adding pairs of digits (Danse- 
reau, 1969). Similarly, most of the pro- 
cessing time in sentence verification may 
be consumed by retrieving and positioning 
the appropriate constituents and keeping 
track of one's place; relatively little time 
may be spent in actually comparing Items. 
If retrieval is a primary determinant of 
the total latency, this would explain why 
the time per find-and-compare operation 
is the same for all constituents. It is 
plausible that the time to retrieve a con- 
stituent may be independent of its type 
or complexity. 

This retrieval plus comparison operation 
resembles the '"scan-and-compare'" opera- 
tion in Sternberg's (1969a) short-term 
memory scanning model for the context 
recall task. At present, there is no way 
of empirically partitioning the slope into 
retrieval and comparison components either 
in memory-scanning tasks or in the sen- 
tence comprehension task. 


A Test of the Model Against Previous 
Research 


Since Wason's (1959) first work on veri- 
fication, a fair number of studies have 
examined latencies in processing affirmative 
and negative sentences. These studies can 
now be reanalyzed in terms of the con- 
sutuent comparison model. But before 
the model can be applied, the rules for 
representing negative sentences must be 
elaborated. Since these previous studies 
did not include explicit embedding strings 
like M isn't true. . ++ Negatives could have 
been represented with either a small scope 
of negation (as predicate negatives) or a 
large scope (as denials) (cf. Weinreic?; 
1963, D. 127). We have used the following 
decision. rule, derived directly from bs 
model, to determine the representation: 
If the ratio of negation time to falsifica- 
tion time was 2:1, then the negatives No 
represented with a small scope; but ae 
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ratio was 4:1, then the negatives were 
represented with a large scope. The rule 
can be derived by considering the relation 
of falsification time to negation time for 
the two scopes. Falsification time, due to 
a mismatch on the inner proposition, re- 
sults from one additional constituent com- 
parison. Negation time, with a small 
scope, is the result of two additional con- 
stituent comparisons, so the ratio of nega- 
tion time to falsification time is 2:1. 
Negation time, with a large scope, is the 
result of four additional constituent com- 
parisons, so the ratio is 4:1, and hence, 
the decision rule for previous studies. 
When these representation rules were 
applied to previous verification studies, a 
surprising regularity emerged. In most 
studies where the sentence preceded the 
picture, the ratio of negation time to 
falsification time was 4:1, indicating that 
the negative was represented with a large 
scope. By contrast, in studies where the 
sentence followed the picture, the ratio 
was 2:1, indicating that the negative had 
a small scope. This regularity was par- 
ticularly convincing in two studies that 
systematically examined the effect of re- 
versing the sentence-picture order. In the 
Clark and Chase (1972) experiments, sub- 
jects were given sentences like The star 
isn't above the plus and pictures like G2 
or (t). In the sentence-first condition, 
the ratio was 3.8:1, while it was 2.1:1 in 
the picture-first condition. In the second 
pair of experiments, Just and Carpenter 
(1971) presented a sentence like The dots 
aren't red followed or preceded by a picture 
of red dots or black dots. In the sentence- 
first condition the ratio was 4.2:1, while 
in the picture-first condition it was 17:1. 
These studies indicate that the same nega- 
tive sentences were represented and proc- 
essed with a different scope of negation, 
depending on whether they followed or 


preceded the picture. 
Eoo 

3 We have analyzed the cases w 
tion for how to code the picture, “code what's on 
the top" or "code what's on the bottom,' agreed 
with the sentence preposition (above and below, 
respectively). In some cases when they do E^ 
agree, there is recoding, an operation to be dis- 


cussed later. 


here the instruc- 


When the model's predictions and the 
actual data are laid out side by side, the 
fit is very convincing. As Table 4 shows, 
the model accounts very well for the com- 
prehension of affirmative and negative sen- 
tences in a large number of previous 
studies. 

The time to retrieve and compare a pair 
of constituents seems consistent within any 
one set of materials and experimental con- 
ditions, such as our own, or Clark and 
Chase’s (1972) or Gough's (1965, 1966). 
However, there is some variability across 
various experiments and conditions. These 
data from the various experiments are 
based on different numbers of observa- 
tions, and different sentences, pictures, etc. ; 
so direct comparisons are difficult to make. 
However, it is plausible that the com- 
plexity of the linguistic or perceptual 
stimuli may affect the duration of a com- 
parison. One factor that clearly does 
affect this duration is practice. 

Practice effects. Extended practice in 
the sentence-verification task naturally 
makes performance much faster (Singer, 
Chase, Young, & Clark, Note 1) What 
is much more important, from our point 
of view, is that even though the time per 
operation decreases with practice, the line- 
arity among the various information con- 
ditions is still preserved. In the Singer 
et al. experiment, three patient subjects 
verified sentences like The star isn’t above 
for 15 days! The sentence preceded the 
picture, and timing started only from the 
onset of the picture. However, the pictures 
were of various types to discourage any 
sentence recoding prior to the picture. 
For example, the sentence The star isn't 
above could be true of the picture like T 
or one like $, where no star was present. 
Since these sentences are similar to those 
used by Clark and Chase (1972), and since 
it was a sentence-first experiment, it was 
assumed that the negative was represented 
with a large scope. The model predicts 
true affirmatives should require k opera- 
tions, false affirmatives, & + 1; false nega- 
tives, k + 4; and true negatives, k + 5. 
In each 5-day period, latencies increased 
linearly with the number of constituent 
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60 PATRICIA A. CARPENTER 
comparisons. Although the slope decreased 
from 58 msec to 38 msec to 29 msec over 
the 15 days, the linearity persisted with 
practice. The model accounts for 97.7%, 
97.4%, and 99.6% of the variance, respec- 
tively, as shown in Table 5. 

In a similar experiment, Young and 
Chase (Note 2) collected verification laten- 
cies on 2 days, Day 3 and Day 6, sepa- 
rated by 2 days of practice on a related 
task. On both Days 3 and 6, latencies 
increased linearly with the number of con- 
stituent comparisons; at the same time, 
the slope for Day 6 was much lower than 
for Day 3, as shown in Table 5. We have 
no model of practice effects—why people 
get faster with extended practice. How- 
ever, extended practice does seem to affect 
the duration of a find-and-compare opera- 
tion similarly in all four information con- 
ditions. Thus, these data provide further 
support for a single underlying operation, 
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as specified by the constituent comparison 
model. 


Chronometric Models of Sentence Verification 


The constituent comparison model shares 
a family resemblance with a previous model 
proposed by Chase and Clark and Tra- 
basso, Rollins, and Shaughnessy, hereafter 
the cACTRAS model (Chase & Clark, 1972; 
Clark & Chase, 1972; Trabasso et al., 
1971). However, the resemblance ends at 
the comparison stage. The CACTRAS model 
posits a different comparison operation 
for each kind of constituent, rather than 
a single iterative operation. For example, 
to process a sentence representation with 
two constituents, CACTRAS would posit one 
operation that compares the inner propo- 
sitions and a separate operation that com- 
pares the polarity markers. A mismatch 
in either comparison operation consumes 


BLE 5 


OF THE CoNSTITUENT COMPARISON MODEL 


Practice effects 
Percentage i Observed (and estimated) latency 
Reference Mesinptechagiins- | Day | aun aaen 
for by (msec) TA FA FN TN 
model Q) [+] G4) | O H5) 
Singer, Chase, Young, | The star is(n't) 1-5 97.7 58 635 703 841 949 
and Clark, Note 1» above. (637) | (695) (869) (927) 
6-10) 974 38 495 523 621 693 
: (488) | (526) (640) (678) 
11-15} 99.6 29 459 488 580 598 
Young and Chase, The star is(n't) FO | SR in PN 
Note 2^ (Control above the plus, 
condition) 
Subject 1 3 99.4 135 843 | 943 | 1,400 | 1,486 
(831) | (966) (1,371) | (1,506) 
6 97.8 46 Ee eR 929 971 
To ) | (792) (930) (976) 
S ë 
ubject 2 3 98.6 154 (183) 900 1,457 1,515 
(937) | (1,399) | (1,553) 
6 99.6 65 713 T i 957 n 
T (709) | (774) | (969) | (1,034) 
ubjeci 3 97.5 100 es 871 | 1,243 in 
93) | (893) | (1,193) | (1,2 
6 96.5 32 | 657 | 714 | gia | 814 
(670) | (702) | (798) | (830 
Nole, Abbreviations: TA = true affirmative, FA 


d = false a iv i 
^ The number of constituent comparisons in the ti afirmative, EN 


1 T! 
€ are indebted to Bill Chase for providing the a cui 


graphs from which the data 


T = false i 
tive case is denoted 3 s d 


points were estimated, 


— true negative. 
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time, and in general, the mismatch times 
are different for each of the two operations. 
According to the cacrRAS model, mis- 
matches take longer than matches because 
the comparison itself takes longer and 
because changing the response index con- 
sumes time. Additionally, some of nega- 
tion time may be attributed to extra time 
spent in reading and encoding a negative. 
This account fails to explain why the 
predicate negatives and denials in Experi- 
ment 1 have different negation times. 
Both types of negatives involve reading 
and encoding the word not; both involve 
a mismatch between an AFF and a NEG; 
and both require a change in the response 
index. If these are the processes that 
account for mismatch time, the negation 
times should be similar in the two cases. 
Yet, negation time is twice as long for 
denials as for predicate negatives. By 
contrast, the relative durations of the two 
negation times are correctly predicted by 
the constituent comparison model. 
Although there are some difficulties with 
the CACTRAS explanation of mismatch time, 
the model can still be extended to account 
for the difference between predicate nega- 
tives and denials. The CACTRAS model 
would have to posit two different com- 
parison operations for the two kinds of 
negatives. Therefore, there would be a 
total of three different comparison opera- 
tions, one for each of the constituents: 
inner propositions, inner polarity markers, 
and outer polarity markers. The mismatch 
times for these three operations would be 
estimated as 254, 373, and 779 msec, re- 
spectively. These parameters account for 
98.3% of the variance among the six con- 
ditions. By contrast, the constituent com- 
parison model's one parameter, represent- 
ing the time (200 msec) to find and 
compare a pair of constituents, accounts 
for 97.7% of the variance. The two ad- 
ditional parameters in the CACTRAS model 
account for an additional but insignificant 
6% of the variance, F (1, 115) = 1.00. 
We can also compare the fit of the present 
model to the CACTRAS model for the eight 
experimental conditions of Clark and Chase 


listed in Table 4. Using two parameters 


for each condition, the cacrRAs model 
accounts for 99.8%, 97.6%, 99.8%, 98.6%, 
99.5%, 99.8%, 99.7%, and 99.3% of the 
variance in the eight conditions. The con- 
stituent comparison model accounts for 
99.870, 97.470, 99.7%, 97.7%, 99.2%, 
99.2%, 99.7%, and 98.1%, respectively, 
and with only one parameter. Thus, the 
more parsimonious constituent comparison 
model is almost as precise at estimating 
these latencies. 

The use of response latencies to infer 
durations of processing stages requires that 
certain assumptions be met. Since the 
CACTRAS model posited separate compari- 
son operations for each kind of constit- 
uent, it was necessary to demonstrate 
that à mismatch between the polarity 
markers consumes the same amount of 
time, regardless of whether the inner 
propositions match, and vice versa. In 
fact, the data showed that the additivity 
assumption was quite reasonable. This 
additivity can be reinterpreted in terms 
of the constituent comparison model's 
single iterative find-and-compare opera- 
tion. According to the current model, the 
two operations in the comparison stage of 
CACTRAS are actually multiple executions 
of the same operation, and so naturally, 
their durations are additive. 


THE CONSTITUENT COMPARISON MODEL 
APPLIED TO RECODING 


The introspective reports and the verifi- 
cation latencies from a number of previous 
studies indicate that there is another way 
to process negative sentences. The alte-r 
native strategy is to recode either the sen- 
tence or picture representation before com- 
paring the two. Recoding refers to the 
process of changing a representation from 
one form into some other equivalent form. 
For example, a negative sentence can 
sometimes be recoded into an affirmative 
(e.g. The door is not open can be recoded 
as The door is closed). The result is that 
the recoded sentence representation is af- 
firmative and matches the polarity of the 
picture representation. In this section, 
we will propose a recoding model based 
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Increment Counter 
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FIGURE 5. 


on the same processes as the main verifi- 
cation model. 

Our examination of previous recoding ex- 
periments indicated that recoding a con- 
stituent consumes about the same amount 
of extra time as a mismatch on that con- 
stituent. Therefore, recoding time may be 
Proportional to the number of find-and- 
compare operations necessary to recode the 
constituent, Using this asa starting point, 
we will attempt to account for the recoding 
Process in terms of the same find-and- 
compare operation that is central to the 
constituent comparison model. 

The recoding model (shown in Figure 5) 
is best explained by working through an 
example. In one recoding study, Young 
and Chase (Note 2) explicitly instructed 
subjects to recode negative sentences like 
The star isn't above the-blus into an affirm- 
ative form with the Same preposition, 
like The plus is above the star. The sub. 
jects were timed while they read the sen- 
tence, recoded it if it Was negative, ex. 
amined a picture like Lor t, and responded 
true or false. Three highly practiced sub- 
jects participated in the experiment. 

The negative sentences are represented 
as denials (as shown in Table 6) because 
the ratio of negation time to falsification 


time was 4:1 in the nonrecoding condi. 
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Does it Match (Neg) or 
(Predicate)? 
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been Recoded?. 


Begin the Constituent 
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Yes 


Substitution 


Yes 


tion. After the sentence is initially rep- 
resented, it is tested for the presence of 
a negation marker. If the negation marker 
is absent, as it is for affirmatives, then 
Processing continues as specified by the 
constituent comparison model. The pres- 
ence of a negation marker triggers the 
recoding process, which consists primarily 
of finding the constituents to be recoded. 
Each successive constituent is retrieved 
and tested to determine whether it is to 
be recoded. If it is, the appropriate alter- 
native is substituted, The actual sub- 
stitution is assumed to take a negligible 
amount of time. If the constituent is not 
one of those to be recoded, then the next 
constituent is retrieved and examined. 
This recoding process terminates when all 
the to-be-recoded constituents have been 
replaced. The duration of the recoding 
Process is therefore proportional to the 
locus of the last constituent to be recoded. 
After recoding is completed, the recoded 
Sentence representation is compared to the 
Picture representation as specified by the 
constituent Comparison model. Since the 
recoded Tepresentation is now affirmative, 
the true condition will entail fewer opera- 
tions than the false condition. The total 
latency is Proportional to the sum of 
recoding plus Comparison operations. The 
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TABLE 6 


REPRESENTATIONS AND PREDICTIONS OF THE MODEL FOR THE YOUNG AND 
CHASE RECODING EXPERIMENT 


Stimulus and representation 


True affirmative 


False affirmative 


Picture ` 


Sentence 

Sentence representation 
Recoding operations 
Picture representation [AFF, (STAR, TOP)] 
Comparison operations EI sp 
response — true 

k operations 


[Arr, (STAR, TOP)] 


4 
The star is above the plus. 


(none—begin comparison process) 


+ 


* 
The star is above the plus. 
Carr, (STAR, Top) ] 
(none—begin comparison process) 
(arr, (PLus, Top) ] 
— index = false 

+ + 
response = false 
k + 1 operations 


True negative 


False negative 


Picture P 

Sentence The star isn't above the plus. 
Sentence representation [NEG, [aFF, (STAR, ror) ]] 
Recoding operations + + + 


Recoded sentence 
Picture representation (PLUS, TOP) 
Comparison operations + + + 
response = true 

k + 4 operations 


larr, [Arr, (PLUS, TOP) ]} 


* 


pl 
The star isn't above the plus. 
{neG, [AFF, (STAR, TOP) ]} 
dox + 
{arr, (AFF, (PLUS, TOP) ]} 
(STAR, TOP) 
- index = false 
ju d 
response — false 
k + 5 operations 


model predicts a total of & + 4 operations 
for the true negative, 3 operations for 
recoding, and k + 1 comparisons as shown 
in Table 6. For false negatives, it predicts 
a total of k + 5 operations, 3 for recoding, 
and k +2 for comparison. The predic- 
tions for affirmatives remain the same, 
since they are not recoded: true affirma- 
tives should take & operations; and false 
affirmatives, k + 1 operations. . 

The data for one subject are shown in 
Figure 6, along with the predictions of the 
model. The slope of the best fitting 
straight line is 70.1 msec/operation for 
above; 71.0 msec/operation for below. The 
model accounts for 99.8% and 98.4% of 
the variance in these two conditions. 
These data support the assumption that 
the extra time for recoding a particular 
constituent is equal to the time for proc- 
essing a mismatch on that constituent. 
The data for the other two subjects are 
given in Table 7. 

There are two differences between this 
Tecoding model and the main constituent 
Comparison model. First of all, the recod- 
mg process is self-terminating, since there 


is no need to retrieve further constituents 
once the appropriate alternatives have 
been substituted. A second difference is 
that the retrieval process is not reinitial- 
ized after each substitution operation. 
One possible reason for this latter dif- 
ference is that the response index is not 


"Below" 
1400F slope - 710 msec/Comparison 
fa 
ri 
$1200 
£l 
> 
H 
5 {000+ 
5 
E "Above" ; 
[ Slope = 70.1 msec/Comparison 
800 
i 1 1 L 1 1 
K Kt K+2 K+3 K*4 K+5 
(TA) (FA) (TN) (FN) 


NUMBER OF RECODING AND COMPARISON OPERATIONS 


E6. The fit of the constituent ur 
da bject who was instructe 
l to data from a sub) i 
auod negative sentences (data from Young & 
Chase Note 2). (Abbreviations: rA = true af- 
amate FA = false afirmative, TN = true nega» 
a D 


tive, FN = false negative.) 


FIGUR 
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TABLE 7 
SUMMARY OF THE PREDICTIONS AND FIT OF THE CONSTITUENT COMPARISON MopEL 
Percentage as 
of variance ime per " 
i i served (and estimated) latency 
7 Ex le of a stimu- accounted | comparison Observed (and 
FAEREREE E eder for by (msec) 
model 
Previous recoding experiments 
TA FA TN FN 
(k)> (k +1) (k + 4) (k + 5) 
Young and Chase, The sar is(n't) above 
Note 2b (Trans- the plus. , 4 dein 
formation 2) Subject 1 99.8 70 65 EA ds n 
i 3 814 1014 118 
Subject 2 96.6 83 kd IN (03 d) 
j 843 .057 i 
Subject 3 98.9 61 EA (55) (11038) (11099) 
Young and Chase; x se is(n't) below 
Note 2^ (Trans- the plus, ES 1.343 
i j 014 1,043 1,300 d 
formation 2) Subject 1 98.4 71 nort dues) (1282) (383 
j 72 3 915 1,086 32 
Subject 2 97.1 72 di (913) (1126) (11198) 
Subject 3 98.8 62 Hn ro dS (1123) 
8 7 B 2 
i; b is(n't) chas- 98.5 198 930 1,120 1,270 1,540 
Slobin, 1966 (adults) | T. p d ) chas: ei ae. ere da 
Current recoding experiment 
TA FA TP FP FD TD 
(k) +1) G2) (: + 3)| (k 44) (e T5) 
Experiment 2, Blocks | Tt is(n't) true that the 98.8 118 833 957 1099 | 1.143 | 1,310 | 1,437 
2-7 dots are(n't) red. (835) | (953) | (1,071) (1,189) | (1,307) | (1,425) 
Note. Abbreviations: TA = false affürmative, TN — true negative, FN = false negative, TP = true pred- 
icate negative, FP = false pre = true denial. 


* The number of constituent comparisons in the tru 
^ We are indebted to Bill Chase for Providing the g 


involved in the recoding operation, The 
simulation suggested that items and o 


ing the 
f p Since recod- 
ing does not involve these extra items and 


Operations, the iterative retrieval process 
does not have to be reinitialized, 


Experiment 2: Blocks 2-7 


The recoding 
the verification 
sentences 
earlier. 


model can be applied to 
of the embedded negative 
in Experiment 2, discussed 
In a series of pilot Studies in 
Which the sentence preceded the picture, 
true predicate negatives were Verified faster 
than false predicate negatives after the 
Subjects were practiced. This result was 
obtained for Blocks 2-7 of Experiment 2. 
indicating that subjects were Spontaneously 
recoding predicate negatives after one 


€ affirmative case is denoted by &. 
raphs from which the data points were e: 


stimated, 


block of trials. Therefore, the data from 
the last six blocks of trials of Experiment 2 


Were analyzed separately to test the re- 
coding model. 


The recoding of predicate 
would proceed much 
our analysis of the Young and Chase 
Study. — For example, suppose the pred- 
Icate negative sentence It's true that the 
dots aren't red was represented as [NEG, 
(RED, DOTS)] and the Picture contained 
green dots. The color Predicate red would 
be replaced by green and then the negation 
marker would be replaced by an affirma- 
üve marker, after two find-and-compare 
Operations, After recoding, the sentence 
representation jg affirmative, so there 
would only be à base number of com- 
parisons, 4. between the sentence and 
Picture “Presentations, for a total of k + 2 
Operations, [n the false predicate negative 


negatives 
as we specified in 
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condition, the recoding plus comparison 
operations result in a total of & + 3 oper- 
ations. In summary, when a recoding 
strategy is used, true predicate negatives 
should involve one less operation than 
false predicate negatives, and consequently, 
should be verified faster. The predicted 
number of operations for all six conditions 
are true affirmative, b; false affirmative, 
k--1; true predicate negative, k + 2; 
false predicate negative, k + 3; false denial, 
k + 4; and true denial, k + 5. Recoding 
was only initiated with the onset of the 
picture which provided the color predicate 
to be used in recoding. Therefore, the 
latencies should reflect the duration of both 
recoding and comparison operations. 

Results and discussion. The mean laten- 

cies for correct responses in Blocks 2 to 7 
of Experiment 2 are presented in Figure 7. 
The abscissa still represents an increasing 
number of operations—both recoding and 
comparison operations, although true pred- 
icate negatives are now hypothesized to 
involve fewer operations than the false 
ones. As before, the latencies increased 
linearly with the hypothesized number of 
operations. The slope of the best fitting 
straight line indicates that it takes about 
118 msec (standard error = 20 msec) for 
each additional constituent comparison. 
Practice effects account for this slope 
being slightly (but not significantly) smal- 
ler than the 169 msec slope found in only 
the first block. The model accounts for 
98.8% of variance among the six means, 
F (1, 115) = 127.35, p «01. The re- 
sidual 1.2% is not significant F (4, 115) 
< 1, and the root mean squared deviation 
was a small 23 msec, The error rates for 
the six conditions were low (2.806 overall) 
and correlated (r = .93) with the latencies 
as shown in F igure 7. 

i The variances in each of the six condi- 
tions also provide information that is rele- 
vant to the predictions of the model. 
The latencies in the six conditions are 
postulated to include the time for multiple 
p eius of a single operation. Therefore, 
ie ig omen should increase linearly 
n fa VE INCREASE number of iterations. 

ct, a linear trend accounts for 89% 


SENTENCE FIRST 
BLOCK 2-7 


Slope = 118 Msec/Comparison 
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Lotency (Msec) 


800 


% Error 
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K KH K+2 K+3 K+4 K+5 
(TA) (FA) (TP) (FP) (FD) (TD) 


NUMBER OF CONSTITUENT OPERATIONS 


FIGURE 7. The fit of the constituent recoding 
model to the results of Blocks 2-7 of Experiment 2. 
(Abbreviations: TA = true affirmative, FA = false 
affirmative, TP = true predicate negative, FP 
= false predicate negative, FD = false denial, 
TD = true denial.) 


of the variance among the six variances 
in the present experiment, 77% in Experi- 
ment 1, and 78% in Block 1 of Experi- 
ment 2, consistent with the hypothesis of 
a single underlying comparison operation. 

The results for Experiments 1 and 2 
show that predicate negatives and denials 
differ in two somewhat independent ways. 
First, predicate negatives have a smaller 
scope of negation, hence they take less 
time to process. Second, predicate nega- 
tives, unlike denials, are sometimes spon- 
taneously recoded. The reason for this 
difference is unclear; perhaps it is because 
the recoded constituents are adjacent in 
the internal representation of the predicate 
negative but not in the denial. Of course, 
subjects can recode a denial when ex- 
plicitly instructed, as in the Young and 
Chase (Note 2) study. 

It should be pointed out that there are 
alternative recoding strategies besides the 
one described above. However, most of 
them, like recoding the picture representa- 
tion into a negative form when it follows 
a predicate negative sentence, make the 
same latency predictions. Still another 
possibility is to recode a predicate negative 
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into an affirmative disjunction, like Ifs 
true that the dots are either black or green. 
This latter strategy is implausible since 
subjects did not report using it, and it is 
difficult to train someone to adopt it. 
What conditions encourage people to 
adopt a recoding strategy? For one thing, 
recoding is more likely if the constituent 
to be recoded has a con tradictory antonym, 
like open and closed or odd and even, where 
the negation of one is equivalent to the 
affirmation of the other. A second condi- 
tion that encourages recoding is a delay 
between the Presentation of the sentence 
and the second source of information 
(Carpenter, 1973; Trabasso, 1972). The 
delay may Provide time for subjects to 
decide whether the sentence is to be re- 
coded, to retrieve the alternative constit- 
uents, and perhaps to perform the recod- 
ing. If these conditions are satisfied, 
well-practiced adult subjects will often 
recode spontaneously. And of course, a 
Sure way to get people to recode is to 
explicitly tell them how and when to do so. 


Recoding in Previous Studies 


We can look for evidence of recoding 
Studies, particu- 


aster than false 
Slobin (1966) did 
of sentence verifi- 
tence preceded the 


false affirmative, 
true negative, k + 2. 


k+ 3. 


k+1; 
egative, 


à ts for 98.6 
of the variance among the four means e 


adults. This data is shown in Table 7 
The results for children, on the other 
hand, indicated that they did not recode 
these same negative sentences, 


3 The non. 
recoding model accounts for 91:207. 97.797. 


94.5% and 91.1% of the variance for the 
6-, 8-, 10-, and 12-year-olds, respectively. 
Thus, under the very same stimulus con- 
ditions, only adults seem to adopt a re- 
coding strategy. : 
It sometimes happens that the latencies 
for true negatives and false negatives are 
equal, implying that recoding occurred 
approximately half of the time (e.g., Gough, 
1965; 1966, Experiment 1). Sucha mixture 
of recoding and nonrecoding strategies 
would predict k operations for true af- 
firmatives, k + 1 operations for false af- 
firmatives, and on the average, k +25 
Operations for true and false negatives. 
In Gough’s studies, recoding could only be 
initiated with the onset of the picture 
which provided the predicates to be sub- 
stituted. Therefore, in both studies, the 
clocked interval should reflect the duration 
of both recoding and comparison opera- 
tions. In fact; the predicted number of 
operations account for 99.8% and 98.9% 
of the variance in Gough (1965) and Gough 
(1966, Experiment 1). respectively. This 
indicates that even results produced by a 
mixture of strategies can be accounted for 
by the model. 
We have argued that the find-and- 
compare operation in the recoding pro- 
cess is the same as the find-and-compare 
Operation in the constituent comparison 
Process, This view was Supported by 


model to account 


The com- 
ent-by-constituent 
find-and-compare process suggests that it 
IS à key operation underlying sentence 


GENERALITY oF THE Cow: 
Comparison Mopkr: 
OTHER SEM 


STITUENT 
EXTENSION To 
ANTIC STRUCTURES 
Even though the mo 
gantly account for the 
plicit negatives, it is 
wider range of sem 
demonstrate th 
More genera] t 
cesses, we will 
vious studies + 


del can very ele- 
Processing of ex- 
also applicable to a 
antic structures. To 
at the model is really a 
heory of comparison pro- 
examine a number of pre- 
hat investigated other se- 


E 


mantic structures like counterfactuals, 
implicit negatives, passives, and quanti- 
fiers to see how the model fares in pre- 
dicting verification latencies. 
Counterfactuals. The counterfactual con- 
struction describes a situation that didn't 
occur, for example, . . . if Judy had left. 
While the counterfactual conditional has 
additional meaning components as well, at 
least one comporient of a counterfactual 
clause seems aptly described by some kind 
of negation, in this case, a negation of 
Judy's leaving. "Therefore, this construc- 
tion should be amenable to the analysis 
given explicit negatives. Carpenter (1973) 
presented sentences containing either a 
factual clause, (e.g., . . . since Judy had 
left) or a counterfactual clause (e.g., . . . if 
Judy had left), and then presented a probe 
sentence (e.g., Judy left) to be verified as 
true or false. The counterfactual clause 
might be represented as [NEG, (LEFT, 
jupy)] and the probe sentence as [arr, 
(LEFT, JuDY)]. The constituent compari- 
son model predicts a linear increase in 
latencies from true factuals, to false 
factuals, to false counterfactuals, to true 
counterfactuals. In fact, the model ac- 
counts for 99.8% of the variance among 
the four means (as shown in Table 8). 
This supports the hypothesis that one 
component of a counterfactual is processed 
much like an explicitly negative clause. 
A similar approach may apply to several 
other modal constructions that have a 
negative component like Mary should have 


SENTENCE COMPREHENSION 


67 


left, Mary couldn't leave, and Mary wouldn't 
leave. Verification of the negative com- 
ponent in counterfactuals and these other 
modal constructions can be accounted for 
by the constituent comparison model 
without any additional assumptions. 

Negative quantifiers. Negation does not 
always have to be explicit and use the not 
morpheme. Often, negation is implicit and 
lexically incorporated into an adverbial. 
For example, it can be demonstrated by 
using Klima's (1964) co-occurrence criteria 
that words like few and seldom are syn- 
tactically negative. Clauses with these 
terms have the same syntactic properties 
as comparable clauses with of. Therefore, 
the model might be able to explain the 
processing of these implicit negatives. 

Just and Carpenter (1971) timed people 
as they verified sentences that contained 
negative quantifiers like Scarcely any 
(hardly any, or few) of the dots are red. 
Subjects were given a sentence that was 
negative, like one above, or a correspond- 
ing affirmative like Many (lots of, most of) 
the dots are red. The picture that followed 
the sentence was composed of an array 
of dots, containing a large subset and a 
small subset where one subset was black 
and one red. Since the ratio of negation 
time to falsification time was approxi- 
mately 2:1, these implicit negatives must 
have been represented with a small scope. 
The data were collapsed over the three 
exemplars of each type of quantifier. The 
model predicts that latencies should in- 


TABLE 8 


ox MODEL APPLIED TO OTHER SEMANTIC STRUCTURES 


1 ConstituENT COMPARIS 


lus sentence 


Reference Example of a stimu 


| — 

Counterfactuals Carpen- 
ter, 1973 

Negative quantifiers Just 
and Carpenter, 1971, 
Experiment 1 

s Gough, 1966, 

" neriment 2 : 

Semantic memory Wason 

and Jones, 1963 

Iplicit negatives Just 

and Clark, 1973 


“| If Mary would have left . - - 


Few of the dots are red. 


The car was(n't) hit by the truck. 


Seven is(n't) an even number. 


If John forgot to let the dog out., 


à "n alse 
then the dog is out, true or false? 


Percentage Observed (and estimated) latency 
of variance 
UY TA FA FN TN 
model ce | & +1] +2) |k +3) 
3 2,264 | 2.047 3,323 
9a d (2,281) | (2.635) (3,344) 
6.3 159 1491 | 1.690 1,954 
AN (1,516) | (1,675) (1,993) 
" 1,020 | 1,130 1,330 
ana 99 — | aiio | qe (1:316) 
226 11660 | 2.080 21370 
92 A (1741) | (1:967) (2/419) 
285 2814 | 3252 6 | 3,670 
946 ^ (25800) | (3175) | 65461) | (3:746) 
= false negative, TN = true negative, 


Nove. Ahiria ETA 
le. Abbreviations: 44 r v 
^ Thé number of constituent comparisons in the true 


= true affir z4 = false affirmative, F 
ive, FA = false affirmative, ~ 
irmative, F a E i 


s denoted by k. 
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crease linearly from true affirmative to 
false affirmative to false negative to true 
negative. These predictions account for 
96.3% of the variance among the four 
means (as shown in Table 8). The indi- 
vidual examplars differ in mean latency, 
but the relation among the four informa- 
tion conditions is fairly stable. Overall, 
the model provides a very creditable ac- 
count for the processing of implicitly 
negative quantifiers. 

Implicit negatives. The implicitly nega- 
tive predicate forget to presents an inter- 
esting opportunity for examining the com- 
Prehension of negation. Not only is this 
a negative predicate, but the proposition 
embedded in it is also negative. For 
example, the sentence John forgot to let 
the dog out directly implies that John did 
not let the dog out (Karttunen, 1971). 
Thus one can study how people extract 
information from the implications of im- 
plicitly negative predicates, as Just and 
Clark did (1973). In that Study (Experi- 
ment 2), subjects were presented with an 
affirmative sentence (John remembered to 
lel the dog out) or an implicitly negative 
one (John forgot to let the dog out) and then 
were timed as they verified the probe 


sentence (e.g, The dog is in) as true or 
false of the imp 


Sentence like Jo, 
is that the d 
have been represented as [nrg 


€ » (our, 
DOG)] The information from 


a rep- 
g is in, 
The model 
atencies should 


€, like The do 
DOG)]. 
at verification | 
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The data 
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the four conditions 
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model can 
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increase li 


among 


as shown in Table g 


We have detailed how this 
also explai ‘essing of 


CIL negatives like different, con. 


flict, unrelated, and except (Just & Car- 
penter, in press). . 

While we cannot specify a priori whether 
or not a word is internally represented as 
a negative, our model does provide a valu- 
able litmus test. If the verification laten- 
cies for a suspect word are shorter when 
it is false than when it is true, then the 
word is probably being processed as a 
negative in that situation. The model 
suggests that forget, different, unrelated, 
conflict, and except are internally repre- 
sented with two components—an afirma- 
tive core as well as a negative polarity 
marker. And the same kind of analysis 
can also exonerate suspect words. For 
example, small is not processed as a nega- 
tion of large; it is represented as an af- 
firmative (Carpenter & Just, 1972; Just 
& Carpenter, 1971). Thus, the theory 
provides a procedure to discriminate nega- 
tive lexical items from affirmative items. 

Active and passive Sentences, Passive 
Sentences have been the object of consider- 
able psychological interest (Blumenthal, 
1967; Johnson-Laird, 1968; McMahon, 
1963; Mehler, 1963; Miller & McKean, 
1964; Olson & Filby, 1972; Savin & 
Perchonock, 1965; Wright, 1969), and 
it seems reasonable to ask how well 
the model accounts for the verification of 
Passives, Gough (1966, Experiment 2) 
Presented affirmative or negative Passives, 
for example, The car isn’t hit by the truck, 
which may have been represented [NEG, 
(Is ur, CAR)], where (S HIT) is an ab- 


Stract Symbol that Tepresents being 
Cipient of hitti 


‘ 8a truck, represented 
as (Is urr, CAR) or (HI 


he model predicts 
Verification | 


mM true affirmative 
m oid passive to false 
e e. 
n fact, the line of [AF md 1 
nce among the four 
4 slope of 99 msec/com- 
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processing strategies in languages other 
than English, for example, languages in 
which the negative morpheme occurs at 
slightly different places in the surface 
structure of sentences? To explore this 
issue Just and Carpenter (Note 3) exam- 
ined the verification of Norwegian senten- 
ces that correspond to the ones used in 
Experiments 1 and 2. The experiments 
were performed entirely in Norwegian with 
native Norwegian speakers as participants. 
In English, the negative morpheme gen- 
erally occurs after a copula verb, in both 
subordinate and superordinate clauses (e.g., 
Tt is not true that the dots are red and It is 
true that the dots are not red). In Norwe- 
gian, the negative morpheme (ikke) occurs 
after the verb (stemmer) in a superordinate 
clause but before the verb (er) in a sub- 
ordinate clause: Det stemmer ikke at prik- 
hene er røde and Det stemmer at prikkene 
ikke er róde. The constituent comparison 
model specifies a matching process between 
semantic representations that are not di- 
rectly dependent on surface structure. 
Therefore, surface structure perturbations, 
such as this difference in the locus of the 
negatives between English and Norwegian, 
should not alter the basic processing of 
these sentences. . 
The results showed that the processing 
of these sentences in Norwegian was very 
similar to processing in English. Without 
any modifications, the constituent. com- 
parison model accounted for 97.7% of the 
variance in the nonrecoding situation 
(Block 1) and 97.9% of the variance in 
the recoding situation (subsequent blocks). 
These results indicate that the model 
generalizes to languages with other surlace 
structure Properties, and the processes are 
not peculiar to the strategies devised by 


English speakers dealing with English 
sentences, 
Answering questions, To be able to 


answer a question, a person must COM- 
m a Tepresentation of that question to 
ne other source of information, In this 
Sense, the comparison process is basic to 
Be answering tasks as well as to 
a e ication tasks. In one kind of question- 

ering task, a person is given a sen- 


tence and then asked a question about 
that sentence, for example, The truck is 
hit by the car. What hits? or What is hit? 
The representation of the sentence and 
question, together with the model of the 
comparison process, can predict whether 
questions will be easy or difficult to 
answer. 

There should be more operations and 
hence longer latencies and more errors 
when the representations of the sentence 
and question mismatch. In fact, this is 
in agreement with the data that exist on 
answering questions about active and pas- 
sive sentences (Wright, 1969) and com- 
parative sentences (Clark, 1972). In both 
of these domains, it is easier to answer 
a question about a sentence when the 
sentence and question representations have 
similar structures, for example, when both 
the sentence and the question are passive. 
Under these conditions the comparison of 
corresponding constituents is facilitated. 
Thus, the process of comparing two rep- 
resentations is basic not only to sentence 
verification but also to answering certain 
types of questions. 

Universal and particular quantifiers. The 
model has also been extended to explain 
the verification of sentences containing 
universal or particular quantifiers that 
refer to pictures of colored geometrical 
figures, for example, A1} of the round figures 
are red and Some of the round figures are 
red. In the proposed representations of 
these sentences the inner propositions were 
embedded in quantifiers. Just (1974) mea- 
sured how long people took to verify these 
two kinds of sentences, examining four 
possible set relations between the red 
figures and round figures. _Just (Experi- 
ment 3) applied a self-termimating version 
of the constituent comparison model to 
the data, and the model can account for 
01% of the variance among the means of 
the eight conditions. Thus, a model he 
serial comparison of constituents in the 
and picture representations can 


explain the processing of sentences quanti- 
xple 


4 
fied with all and some. 
dpi A PA 


1 According to the 
ing constituents 10 th 


sentence 


model, all of the correspond- 
e sentence and picture repre- 
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Semantic memory verification. It is im- 
portant to note that the constituent com- 
parison model is not restricted to sen- 
tence-picture comparisons but applies 
equally well when a sentence is compared 
to information retrieved from semantic 
memory. Just’s sentence-picture verifica- 
tion task was analogous to Meyer's (1970) 
semantic memory verification task (e.g., 
All/some dogs are animals), and so per- 
formance in the two situations can be 
directly compared. In fact, the pattern 
of verification times was very similar in 
the two situations. Similar results in the 
two domains might be explained in terms 
of a process that is common to both tasks. 
The process that compares the representa- 
tion of the sentence to some second source 
of information—be it a picture or semantic 
memory—is common to both tasks. There- 
fore, the single constituent comparison 
model proposed by Just (1974) can account 


for both sentence-picture and semantic 
memory verification. 


Affirmative and negative sentences that 
refer to semantic memory are also pro- 
cessed as the constituent comparison model 
specifies. Wason and Jones (1963) asked 
subjects to verify sentences like Seven isn’t 
an even number or Eight is an odd number. 
This task required that subjects compare 
the information in the sentence represen- 
tation with the information retrieved from 
semantic memory. Just as for sentence- 
picture comparisons, the model predicts 
that the latencies for the four information 
conditions—true affirmative, false affirma- 
tive, false negative, and true negative— 
should increase linearly. This they do 
with a slope of 226 msec/comparison with 
the model accounting for 92.195 of the 


= ee, 


sentations are compared even j 
encountered on an early 
sary because constituen 
markers can reverse the t 
However 


if a mismatch is 
comparison. This is neces- 
ts like negative Polarity 
n rey j ruth value of a Statement. 
» In situations where People know that 
there are no negations, detecting a single mismatch 


may be sufficient to determine that the sentenc 
is false. In fact, people may adopt the eflicie i 
strategy of stopping the comparison process me 
the first mismatch in tasks that employ mite 


simple, active, affirmative eclarative sen ce: 
affirm: declar; 3 
? tences 


'ariance among the four means as shown 
in Table 8. This again illustrates how 
the model is not constrained to situations 
in which a sentence is verified with respect 
to an accompanying picture. 

In both of these cases in which the 
model has been extended to verification 
of information from semantic memory, 
there has been an implicit assumption that 
there are processes retrieving information 
from semantic memory. The assumption 
has been that the duration of these re- 
trieval processes is small or constant rela- 
tive to the duration of the comparison 
process. Similarly, in sentence-picture 
verification, it is assumed that the dura- 
tion of picture encoding is small or con- 
Stant relative to the duration of the com- 
parison process. With these assumptions, 
the model can account for semantic mem- 
ory verification, as well às sentence-picture 
verification. This result indicates that the 
comparison processes are not specific to 
one situation but may be general proc- 
essing strategies that are used in a variety 
of comprehension tasks. 

The mental processes described by this 
model are not specific to the sentence 
verification paradigm but occur in a large 
number of situations that involve verbal 
comprehension (see also Carroll, 1972; 
Trabasso, 1972). These more general 
processes involve relating the information 
Írom a sentence to information from a 
Second source, such as the listener's pre- 
vious knowledge of the world. For ex- 
ample, in order to agree or disagree with 
a Statement, it is necessary to compare 
the statement to a representation of one's 
own belief. In order to answer a Wh- 
maton jae il fen te 
that Meo mined 4 Y ire. 
Compared (o ur js ence) must be 
the Duameseria vac nowiedge ess 
&lewed, Te B. mL MSN can be res 
Won n O acquire new informa- 
i rough a verbal medium, the old 
information in the communication will 
ire 8 4 Basso whieh the ne 

ol 1$ added. The determination o 
which information is old can only be 
made if the Sentence representation is 
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compared to previous knowledge.  Else- 
where, we have demonstrated how these 
comparison operations also occur in fol- 
lowing simple instructions and in various 
other instructional situations (Just & Car- 
penter, in press). Thus, the basic kinds 
of operations described by the model are 
part of a large class of comparison opera- 
tions that occur very commonly when we 
comprehend linguistic material. 


CONCLUSIONS 


This analysis of the sentence verifica- 
tion process allows us to examine various 
situations and determine what features of 
processing remain invariant. In the three 
situations we examined in detail, people 
verified simple or embedded affirmative 
and negative sentences, recoded negative 
sentences, and verified a number of rela- 
ted semantic structures like implicit nega- 
tives, counterfactuals, and quantifiers. The 
analysis suggests that several features of 
the processing are common to these vari- 
ous tasks. First, the find-and-compare 
operation appears in all three situations. 
Moreover, this operation reappears in a 
number of other simple cognitive tasks, 
such as context recall (Sternberg, 1969a) 
and same-different classification tasks 
(Posner, 1969). Second, these basic oper- 
ations seem to be performed serially since 
the latencies increase with the number of 
comparison operations that are performed. 
Third, the processing rate lies within 
certain boundaries. The estimates for the 
retrieval and comparison operation were 
primarily between 100 and 300 msec. 
However, with extended pracüce, people 
can retrieve and compare sentence CON- 
stituents at a very fast rate, perhaps as 
fast as 30 msec/operation. Fourth, the 
order in which constituents are compare 
reflects the linguistic structure of the sen- 
tence and the information structure of the 
task. And fifth, the general nature of the 
control structure remained similar, 4'* 
though not identical, in these tasks. That 
is, the constituents of two representations 
Were compared and the outcomes deter- 
Mined the subsequent flow of control. 


The analysis also indicates what features 


of the processing vary across situations. 
Since an information processing system 
adapts itself to its current task, one would 
expect differences in behavior from task 
to task. One difference was in the control 
structure. For example, in the three situ- 
ations examined, it was found that the 
structure of the task determined whether 
or not all the constituents were compared. 
Not all of the constituents need to be 
compared in order to recode a negative 
representation ; comparisons proceed only 
until the negation marker and the pred- 
icate are recoded. However, when veri- 
fying affirmative and negative sentences, 
all of the constituents must be compared. 
Another difference lies in the rule that 
relates the number of mismatches to the 
decision of true or false. In all of the 
current studies, an even number of mis- 
matches resulted in a correct /rue response. 
However, one can envision tasks where 
two mismatches would not lead to a true 
response. For example, suppose the sen- 
tence The dots aren't red were presented 
with a green triangle. Here, the correct 
response is "indeterminate." And, the 
basic incompatibility between the repre- 
sentations might be detected very quickly, 
perhaps during the picture encoding stage 
when an attempt is made to align the 
two arguments. Thus, the number of 
constituents that are compared and the 
rule that relates the number and locus of 
mismatches to the final response of true 
or false may vary from task to task. The 
role of the processing analysis is to specify 
the task structure and state how it in- 
fluences the control structure of the pro- 


cessing system. 


Sentence verification has been regarded 


in the past as a special kind of task, 
unrelated to others. The current model 
demonstrates that, on the contrary, the 
kinds of operations that occur in sentence 
verification are not specific to language 
processing. The verification task can be 
easily incorporated into the same explan- 
v framework that has been used to 
describe other kinds of tasks. Morearen 
we have argued that the comparison pr oC- 
ess is not specific to a true-false verifica- 


atory 
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tion paradigm but rather occurs whenever 
a person answers a question, follows an 
instruction, or incorporates statemen ts into 
his belief system. The main goal has 
been to show that a basic comparison 
operation, a simple organization of the 
information, and a straightforward control 
structure can account for a wide range 
of experimental results, The model should 
help place this aspect of language pro- 
cessing in the more general area of cogni- 
tive psychology. 
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Markus 


University of Michigan 


A confluence model is developed that explains the effects of birth order and 


family size on intelligence. 


is conceived of as depending on the cumulative effects of the intellectual en- 


vironment, which, for the purposes of 
siblings’ and parents’ intelligence. 


the model, consist Primarily of the 


Mutual influences, through time, on the 


intellectual development of the siblings are described by the growth param- 


S positive as well as negative effects of 


last born and the only child. The model explains several features of a large 


birth order study carried out on nearly 400,000 19-year olds. 
implications of the model are discussed, among them the effects 


tion between successive children. 


of age separa- 


In agreement with the implications, data 


on the relatively low IQ of twins and triplets are cited, Extensions of the con- 


is examined for its usefulness in explicating a general 
logical problems: the emergence of individual difference: 


A recently published review on birth 
order effects (Schooler, 1972) has all but 
put this field of research to rest. That 
article, entitled “Birth order effects: Not 
here, not now!”, concludes that “The 
general lack of consistent findings revealed 
by this review leaves real doubt as to 
whether the chance of positive results is 
worth the heavy investment needed to 
Carry out any more definitive studies” 
(p. 174). While the general tone of the 
article and of its conclusion is that of 
understated skepticism, the author ends 
the article on a cryptic, but as it turns 
out, rather prophetic note: 


On the other hand, I suspect that other investiga- 
tors, including myself, will not be able to resist the 
temptation of taking a cheap bet on a long shot by 


Schooler’s forecast about the continued 
fascination with birth order found solid 
= ae 
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The confluence model 
class of social-psycho- 
s in a social context, 


confirmation in an extensive series of 
studies concerned with the effects of mal- 
nutrition on the intellectual performance 
of children born in the Netherlands at 
the end of the second World War (Stein, 
Susser, Saenger, & Marolla, 1972). As 
an unanticipated by-product of these 
studies, Belmont and Marolla (1973) dis- 
covered a strong relationship between 
birth order and intellectual performance. 
Even the most skeptical of birth order 
researchers will find it difficult to ignore 
these findings which exhibit most interest- 
ing patterns and Which are based on no 
fewer than 386,114 data points, the entire 
male population of the Netherlands who 
attained 19 years of age in the years 1963 
to 1966. The Belmont-Marolla results 
are reproduced in Figure 1 Which reveals 
the five following features: First, in- 
tellectual performance aS measured by 
Raven Progressive Matrices declines with 
family size. Second, within each family 
size (j > 2, where J is the number of 
Children), Raven Scores also decline with 
birth order. Third, Within each family 
Size (j > 2), the last born shows a greater 
decline in intellectual performance than 
any other birth rank. For a given family 


size (j > 2), the decline from the jth — 1 
to the jth child is about three times greater 


Edd 


SH 
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than the average intellectual difference 
between adjacent siblings. Fourth, ex- 
cluding last borns, intelligence and birth 
order are related by a quadratic function: 
There is a clear indication of a reduction in 
the rate of intellectual decline with higher 
birth orders, and there is an upswing for 
families of eight and nine. Fifth, the only 
child scores at about the same level as the 


100 


score 


99 


98 


Transformed Raven 


97 


FIGURE 1. Average transfo 
(i) and family size (j), recalcul 
Raven scores were reported by Belt 


à 6 
from 1, high, to 6, low. For the purpose 
formation, Xt, = 113.45 — 5.0047X, waa js 
the scale so that increasing values now 


rmed Raven s 


Belmont © T 
ated a. Marolla in terms of six categories 
mont and ^ 


s of the prese 


firstborn of a four-child family. The mag- 
nitude of these effects is revealed in the 
fact that the highest (firstborn of two) and 
the lowest (last born of nine) scores in the 
Belmont-Marolla data are separated by 
about two-thirds of one standard deviation.! 


1 Note that in the course of transformation, a 
constant of over 100 was added to the original 


5 6 


4 
Birth order 


as a function of,birth order 
E p Marolla (1973). (The 


nt analysis a linear trans- 
n these scores, inverting 


erformed o! lligence and 


dicate increasing Inte 


setting the score of the only child at 100.) 
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FIGURE 3. Changes in the intellectual development of siblings in a four-child family. 


family is assumed to equal 100; had in- 
dividual i remained an only child until 
adulthood, /4, he would develop according 
to f(t), and his intellectual level would 
reach f (£4). When i is £, years old, however, 
there is a change in the family configura- 
tion, and because of the changed intellectual 
environment, i henceforth will develop 
according to g(t), where a, = 50 in this 
example. 

The individual's absolute 
level, Mi, at maturity can be expressed 
as the sum of the two ordinate segments my 
and mi, as seen in Figure 2, The person’s 
level at maturity cannot be estimated from 
g(ta) because g(t) describes his growth 
only past the age ty. The intellectual 
increment that accrues thereafter must 
be added to the accumulation at ^. Hence, 


g(t) must be adjusted for its intercept, and 
we have 


intellectual 


Ma, = mo m = 00)" + gy 


ta 
- e(l — BH) Q) 
Tol — erm) aq eut] 


The growth parameter o represents 
important aspect of this an 
reflects the significant cha 


B an 
alysis, for it 
nges in the 


individual's intellectual environment. But 
it also reflects the interdependent nature 
of intellectual development within the 
family context. Because he is part of the 
given environment, the individual (with 
his changed intellectual growth rate) affects 
the intellectual environment of the siblings. 
The various os within the given family are, 
therefore, interdependent and show the 
mutual intellectual influences among family 
members. It may be noted that the later 
these influences come in the individual's 
life, the smaller will be their effect—a fea- 
ture of intellectual development that is 
also reflected by the variable asymptote 
formulation. For, given M; = mo +m, 
the older the individual, the larger pro- 
portionally is mp». And therefore mı, which 
denotes the proportion of growth after a 
change in the intellectual environment; 
makes less and less of a contribution. - 
Figure 3 shows graphically how fs 
intellectual levels, M, of the ith a o 
a family of j children change a aries, 
siblings are introduced at different ows at 
The first child is born in 1950 and fidt Qo. 
à rate characterized by the puis first- 
In 1956 a new sibling arrives; ees to a. 
born's growth parameter chars ceases to 
Since, at that time the firstbo™ 


( 
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be an only child, the change from ap to a; 
is positive. This phenomenon will be dis- 
cussed fully below. In 1961 a third sibling 
is born, causing a decline in the intellectual 
environment. In 1967 a fourth sibling 
arrives reducing the level even further. 
The growth parameter in the function 
describing the intellectual development of 
the first child has thus changed from æo to 
o (in 1956), to o» (in 1961), and finally to 
os (in 1967). The firstborn continues to 
develop to maturity at the latter rate. 
Corresponding changes occur in the in- 
tellectual development of the other siblings 
in the family. Should some of the children 
leave home, or should the family acquire 
other members, for example, should grand- 
parents join the family, the growth rates 
of all family members would be adjusted 
according to the new intellectual environ- 
ment thus created. 

Estimates of the as may be obtained 
from information about individual adult 
intellectual levels and age gaps between 
siblings. To give an example using the 
aggregate Belmont-Marolla data, we know 
that the transformed Raven scores for 
Mii (the only child) and for Mi: (firstborn 


of two) were 100.00 and 100.57, respec- 
tively, when they reached 19 years of age. 
Assuming a & value of .1 and a mean gap 
between firstborn and second born of 2 
years, we obtain the following simul- 
taneous equations: 

Mii = ao(1 — e-C*09*) = 100.00 

Mis = ay(1 — et?) 

+ [e (1 — e-C»*as»?) 

— e(1 — e-60*0)] = 100.57, (3) 
which yield the solution, a = 102.780 and 
a, = 103.383. 

Different sets of as would be found by 
assuming different gaps between siblings. 
Moreover, with true individual level data 
on intelligence and sibling age separation, 
a least squares estimation procedure would 
be employed. 

The precise nature of the relationship 
between the growth parameter œ and the 
pattern of gaps can be determined only 
if empirical information is available about 
adult intelligence levels of a large popula- 
tion for which gaps are also known. On 
the theoretical level, however, the con- 
fluence model predicts different birth 
order effects for different gaps. Figure 4 


FIGURE 4. Intellectual em 
with small age separations (ab 


vironment (thin broken li 
ove) and large age 


ne) ina four-child family 
separations (below)- 
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illustrates the significance of gaps between 
successive siblings. The gaps in Figure 4, 
top, are small, and each successive child 
comes into a severely impoverished in- 
tellectual environment (indicated by the 
thin broken line). If, however, the gaps 
are substantial, the intellectual level of the 
entire family is allowed to rise and the 
newborn comes into a more favorable 
environment. If the gaps are sufficiently 
large, as they are in Figure 4, bottom, the 
intellectual level actually might show an 
increase with birth order. This conse- 
quence of the confluence model is quite 
important because there are data that do, 
in fact, show a positive relationship be- 
tween birth order and IQ (Thurstone & 
Jenkins, 1929; Willis, 1924). Because they 
constitute a major inconsistency with other 
results, data like these discouraged the- 
orizing about the nature of birth order 
effects. Unfortunately, nothing is known 


about age separations between siblings in 
these studies. 


Each graph in Figure 4 has been inter- 
sected at a point on the abscissa where the 
third born had reached some fixed age, the 
same in both graphs. In Figure 4, top, 
where the gaps are small, intellectual en- 
vironment at birth of this third child is 
approximately equal to 50. In the bottom 
portion of the figure, with the gaps ex- 
tended, it is 85. It is also obvious in the 
top portion that the declines in the quality 
of the intellectual environments for each 
successive child are smaller and smaller. 
Had the number of children been extended 
and the small gaps preserved, eventually 
the newly born child would come into a 
more favorable environment than his im- 
mediate predecessor. Assuming 1-year and 
2-year gaps, and, for illustrative purposes, 
setting the q parameters to be constant at 
100, the environments at birth of nine 
successive children were calculated, and 
they are shown in Table 1. Itis clear that 
these environments decline for the first 
seven children but that they begin to 


TABLE 1 
INTELLECTUAL ENVIRONMENT AT Birtn 


Intellectual level of sibling 
Intellectual 
Mother | Father 1 2 3 | 4 5 | 6 | . | á ^ environment 
l-year gap 
100 | 100 0 
100 | 100 1 0 66.7 
100 | 100 4 1 0 50.2 
100 | 100 9 4 1 0 41.0 
100 | 100 | 15 9 4 1 0 35.7 
100 | 100 | 22 | 45 9 4 1 ð RA 
100 | 100 | 30 | 2 15 9 4 i 314 
100 | 100 39 30 22 15 5 i 0 " 31.2 
100 | 100 | 47 | 39 | 39 | 33 is A i i EY 
2-year gap 
100 100 0 
100 | 100 4 0 66.7 
100 | 100 | 15 4 0 51.0 
100 | 100 | 30 15 4 0 43.8 
100 | 100 | 47 | 30 | 45 4 o 41.5 
100 | 100 | 63 | 47 | 3) | 4s à å AR 
Of ane e oal a ee i re! 
AS | g6 | e | s la | 15 r 52.1 
100 [39 | 9 | 4 | 5 | ae | 2 3 | 46 1 " Eon 
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improve tHereafter, revealing the quadratic 
trend that characterizes the Belmont- 
Marolla data. This is evident for gaps 
equal to 1 year as well as for gaps equal to 
2 years, although in the second case the 
upswing comes somewhat sooner. Also, 
as already noted, the absolute levels of 
intellectual environment are higher when 
the gaps are 2, rather than 1, years. Note 
that the environmental patterns shown in 
Table 1 were obtained with œ held con- 
stant. These examples, therefore, illus- 
trate the fact that temporal configurations 
of intellectual levels within the family are 
in themselves capable of explaining much 
of the Belmont-Marolla data. Thus, seen 
in that table is a quadratic decline with 
birth order, and a decline with family size 
can be readily inferred. A more complete 
account of the birth order data, however, 
requires the growth parameter, o, which 
serves three functions: It allows for scale- 
congruent numerical predictions; it can 
reflect the handicap of the last born; and 
when calculated from data having gap 
information, it can be used as an empirical 
index of growth changes that are brought 
about by changed family configurations. 
This theoretical result that gaps between 
children affect their intellectual level leads 
to the prediction that IQ scores of twins 
and of triplets should be generally lower 
than the population average. This is in 
fact so (McCall, Appelbaum, & Hogarty, 
1973; Mehrotra & Maxwell, 1949; Thur- 
stone & Jenkins, 1929). In agreement with 
previous studies, Record, McKeown, and 
Edwards (1970), for instance, report an 
average verbal reasoning score of 95.7 for 
2,164 twins and 91.6 for 33 triplets. A 
comparison group of single birth indi- 
viduals earned an average score of 100.1. 
Admittedly, other factors may influence 
intellectual attainment of children of mul- 
tiple births (Churchill, 1965; Lewis, 1963; 
Wilson, 1972). However, it is also a rea- 
sonable hypothesis that to the extent lign 
a family with two nontwin siblings and a 
famil with twins represent different in- 
ir T environments, the intellectual 
Eon » twins and nontwin siblings should 


flect these differences. For families with 
reflec 


two children, the intellectual environment 
at birth of twins is (100 + 100 + 0 + 0)/4 
= 50. For nontwin siblings, this environ- 
ment is (100 + 100 + 0)/3 = 67 for the 
firstborn, and it is obvious that it must be 
higher than 50 for the second born, regard- 
less of how short the gap and how close to 
zero the intelligence level of the firstborn 
when the second child is born. Moreover, 
each twin constitutes a large portion of the 
other's environment, and hence twins are 
less likely than other children to interact 
with their parents or other siblings. 

It should also be clear from the above 
illustrations that the level of mental 
development decreases with family size. 
Regardless of the size of gaps, the overall 
intellectual environment declines as family 
size increases because in larger families the 
proportion of individuals with relatively 
low intellectual development must be 
necessarily high. Large gaps help younger 
children surpass older children, and they 
may cause a reversal in the relationship 
between birth order and intellect. 

Tet us return to last and only children. 
It was clear from Figure 1 that intellectual 
ability declines systematically with birth 
order. But the last child constitutes a 
departure from this pattern showing à 
decline that is substantially greater than 
that of the intermediate children. This 
discontinuity can be seen better in Table 2 
in which linear regression slopes were com- 
puted for the Belmont-Marolla data. In 
nearly every case the decline from the jth 


TABLE 2 


DECLINE IN TRANSFORMED RAVEN SCORES WITH 
BIRTH ORDER FOR Last-BorN AND EARLIER- 
Born CHILDREN 


Family size 1 to (j-1) (j-1) toj 
2 — .520 
3 -270 435 
4 .257 .541 
5 .253 .666 
6 .250 .756 
7 .146 1.056 
8 :221 811 
9 212 625 

Average .230 677 


2. 
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— 1 to the jth child is considerably greater 
than the average decline for other consecu- 
tive siblings. Because of their small num- 
bers in the Population, birth ranks below 
five or six are often collapsed in birth order 
studies. Hence, the sharp decline for the 
last born as well as the quadratic trend, 
such as seen in the Belmon t-Marolla data, 
may have been inadvertently concealed in 
past birth order research. There are some 
indications that the decline of the last 
born is not limited to the Dutch data. 
Altus (1965), for example, reports Scho- 
lastic Aptitude Test data that fit the above 
pattern. Ina two-child family the decline 
from the first to the second (i.e., last) born 
is 10 for males and 20 for females. But 
in a three-child family, the decline from 
the first to the second child is only 1 and 3 
for males and females, respectively, . But 
again, the decreases in Scholastic Aptitude 
Test scores are 31 for males and 13 for 
females when the second and last children 
are compared in three-child families, 

here is another interesting 
the Dutch data. 
research on birth 
Maller, 1931; 
Note 1) the only child 


finding in 
Consistent With other 
order (Damrin, 1949, 

1963; Breland 
does not fit 


of the firstborn of a f$ 
Breland (1974) wrote recently ; 


: + + but I have found 


Were we to extrapolate from 
the firstborns of families of vari 
the expected transformed R 
the only child should have b 
Hence, the Score of 100.009 
obtained by only children in the Belmont- 
Marolla study represents a decline of 1.039 
similar to the declines of las ] 


Score of 
cen: 101.039, 
See Figure 1) 


The failure of only children to Conform 
to the pattern of birth order Tesults in the 
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Belmont-Marolla research and in 
studies has never been understood. Gen- 
erally, the only child has been regarded as 
an anomaly among firstborns (Altus, 1966; 
Belmont & Marolla, 1973; Schachter, 
1963). It has been Suggested, for example, 
that many only children come from parents 
who either cannot or will not have 


other 


more 
children, Neither condition, however, nec- 
essarily implies that the child wil] be 


handicapped. Many Parents who cannot 


have more children give their on] 
more attention ; i 


otherwise be the case, 
gone conclusion that t} 
more than one child | 
depressing effect on its į 
This leaves parents who 


Want no 
a necessarily 
ntellectua] growth. 
cannot have More i 
à Specific Physio- 
mental consequences for the į 
of the first child, Parents 
whose first child was Such an intellectu] 
disappointment th Were unwilling 
t ave << e ; 
O have ; and parents Whose first 
to begin Vith, 
rhat Proportions 
e from these various 
» and there 
Children 
from other 
There is, howe 


Ver, another Xpl: 
not necessarily 


e: 
contradictory 
depressed 


BIRTH ORDER AND INTELLECTUAL DEVELOPMENT 83 


prising if the opportunity to perform such 
a "teaching" function did not have bene- 
ficial effects for intellectual development. 
It appears from data reported by Belmont 
and Marolla that there is a fairly high 
penalty for being last and that the removal 
of this penalty augments intellectual growth 
even under otherwise relatively unfavorable 
circumstances. 

'The last born's handicap is, of course, 
reflected by the as which change with the 
changing family configuration. In fact, all 
changes in the intellectual environment 

- captured by the parameter a. Impor- 
tant new information may be obtained 
when we inquire about the way in which the 
parameter a relates to the manifest vari- 
ables of birth order data. For example, 
the Belmont-Marolla results represent 
clear effects of at least three variables. It 
was noted in Figure 1 that the major factors 
of the Belmont-Marolla data were birth 
order (7) and family size (j). However, 
it was also evident that the last-born and 
only children exhibited a decrement larger 
than expected from the overall pattern. 
Hence a third variable, \, the last born's 
handicap (including the only child as a 
last born), also had an influence on the 
data. Finally, it will be recalled that the 
Dutch data were characterized by a qua- 
dratic trend. Given these three variables 
(i, j, and à) and the quadratic trend, the 
transformed aggregate Raven scores of the 
Belmont-Marolla study are accurately 
described by the following regression equa- 
tion which accounts for 97% of the vari- 


ance: 


ar, = 101.31 — 31i 
My = 101 Hs 


+ oti — .37j + 48. 


1 ariable A is equal to 0 for last children 
i bi pé esit hs Once obtained, the as 
pent be examined for the relationship to 
: variables, allowing us to evaluate the 
un influence of these factors on 1m- 
relative 1 growth within the family. . 

aan to obtain the as, we posited a 
reasonable gap pattern 
stic purposes, in con- 
the Belmont-Marolla ag- 


The pattern assumed an 


ma 


zregate data. 


average separation between siblings of 2 
years except for the last born, who is 
separated from his closest sibling by 4 
years. Since there are1+2+... +9 
= 45 as to be evaluated and, in the aggre- 
gate data, 45 Adult Raven Scores, the 
growth parameters are obtained exactly 
by means of simultaneous equations. 

With 2- and 4-year gaps, when the 45 as 
are regressed upon the four independent 
variables in Equation 4, 69% of the total 
variance in the as is accounted for. Of the 
four variables, family size G) was sig- 
nificant at the .001 level (¢ = 8.16) and the 
last born's handicap (A) at the .02 level 
(t = 2.46). Of greatest interest, however, 
is the fact that the contributions of 7 and 
?, the two birth order variables, were not 
significant in this multiple regression anal- 
ysis. Recall in this context that, according 
to the confluence model, the simple rela- 
tionship between birth order and intellect 
may be either positive or negative, depend- 
ing on the gap pattern among siblings (see 
also Figure 4). Itisless important to know 
what birth rank the individual occupies 
than how many years separate him (or her) 
from older and younger siblings. Family 
size, however, is not an artifactual variable, 
and it necessarily depresses intellectual 
environment. As j increases, so does the 
proportion of family members with low 
absolute intellectual levels. Hence the 
intellectual environment is necessarily di- 
luted. Within a given sibship, large age 
separations between successive siblings may 
partly mitigate the negative effects of 
family size. While it is for future studies 
to determine what might be the trade-off 
between family size and age separation, it 
is obvious that the counteraction of the ill 
effects of family size by age gaps is severely 
limited by the physical constraints of 
mother's age. The larger the sibship the 
greater need be the separation between 
successive siblings. But with the size of 
sibship held constant, the sum of age 
separations between children cannot exceed 
the mother's fecund years. And moreover, 
with large gaps the overall intellectual level 
in the family will suffer because each suc- 
cessive child must spend relatively long 
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periods of his growth as the youngest of 
the family and suffer the handicap of the 
“Jast” child. 


CoNcLusroxs 


The model developed in this article Was 
based on data collected from individuals 
who had during the Dutch 
famine of 1944. Apparently, no jl] effects 
of malnutrition Were found from these 
Subjects when Comparisons of intellectual 
Performance were made between them and 
other cohorts Whose Nutritional resources 
Were Satisfactory (Stein, Susser Saenger, 

Marolla, 1972). It is, nevertheless, 
Possible that the substandard nutritional 
conditions under which the Belmont- 
Marolla subjects, many of whom are Írom 
the above sample, were reared may have 
intensified birth order and family size 
effects without changing the overal| in- 
tellectual level of the entire Population. 
Itis also Possible that because the Belmont- 
Marolla data cover the entire 


Population of 19-year-olds, Socioeconomic 
biases may have contributed to the results, 


S pre- 
sumably come from a more homogeneous 


socioeconomic background, they constitute 

added support for the Dutch findings, 
Second, the data on which the Present 

analysis was made are scores of the Raven 

Progressive Matrices test, which i : 

to be fairly free from cultural and e 


: NViron- 
mental influences. © would expect 
therefore, that verbal tests wo Id reveal] 
patterns of results that Would be ins 
Pronounced than those o nt ; ^1 
Marolla. The data analyzed b es 
Note 1) are verbal in ature l e 
Seem to be “remarkably Similar,” bie 
&stingly, Altus (1965) found birth “i 
effects in verbal but not in the qua ntitaj 


~~ 


Scores of the Scholastic Aptitude Test 
given to University of California students, 
‘ith these Cautions in mi 


“ronment within which children develop, 
This “intellectual environment” may be 
assessed simply by averaging the absolute 
intellectual levels in the given family. For 
Particular cases, of course, we may wish 
to know the precise nature o 


siblings anq parents. But for the purposes 
ol gross analysis, it Suffices to consider the 
interdependence among the 
tellectua] levels 
interdependence need not excl 
whose intellectual level may also be affected, 
however little, by the Presence of children, 
Casual observations Suggest th; i 
garten teachers i 


children’s in- 


ecrement Note th; ha Intellectua] 
environment been Tepresented not as an 


Positive effects on intelli- 


Kenge, a sult that is contrary to the 
a a this rea, including of 
m on Mont~Marol}a Study, It is 
sut Tintuitive to treat intellectual 
cme à sum, | environment 
Te One person with an IQ of 150 

rely superior to one Consisting of five 
Persons h with a I 30. It is 
Nevertheless true that environment of 
me Du each with 3n IO of 150 js 
ade IE y vironment of just one 
born ur H ema € 
level or nae an average intellectual 
Nas: "St case and of 75 in the 

Thi 


that i analysis leads to the conclusion 
. Mere are wo Primary determinants 
‘tua growt| i lded in 
fno. i SIowth F beddec 
the individuap $ » both im 


s itelfeca. 3 unos oen. 


t 


E 


BIRTH ORDER AND 


One is the spacing between children and 
ne is ing t À ; "3 
ihe other is family size. Large a 
beneficial to the younger and a a 
to the older sibling, small spacing 1S r 
detrimental to the older but more harmtu 


to the younger. Family size is damaging 
to all. In addition, the opportunity to 
serve as an intellectual resource 1s a 43 
that explains the only-child and the ast- 
child syndrome. Neither has an oppor 
tunity to be a “teacher” of others. 
The five features of the Belmont- 
Marolla data can be explained as follows: 
First, intellectual performance decreases 
with birth order but only when there is 
close spacing between successive children. 
Under these conditions each successive 
child has access to a less favorable intellec- 
tual environment. With larger gaps be- 
tween children, this pattern can be arrested 
and even reversed. Second, according to 
the confluence model, intellectual ability 
decreases with family size because the 
larger the sibship the poorer the intellectual 
environment. Excluded from this gen- 
eralization, of course, are cases where 
family size is increased by adoption of 
nearly adult children or by the inclusion of 
adult relatives (grandparents, aunts, uncles, 
etc.). However, when the family increases 
as a result of new births, the relationship 
between intellectual environment and fam- 
ily size will be necessarily negative. In 
contrast to their capacity of modifying 
birth order effects, gaps, however large, 
cannot overcome the negative effects of 
sibship size. Third, the last child shows a 
larger drop in intelligence scores because 
of a lack of opportunity to “teach.” 
Fourth, the only child also suffers from 
the "last-child" handicap. Finally, the 
upswing in the Raven scores which were 
found for the lower ranks in larger families 
is a feature of combining exponential func- 
tions, and it occurs because lower ranks 
have an environment that includes fewer 
individuals with an intelligence level lower 
than their own, while this is not the case for 
higher birth ranks. | 

If the Belmont-Marolla data represent 
ness] pattern —and Breland s (Note 1) 
(AR hat they may indeed do so— 


data suggest t 
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then it is clear from the ed: pe sis 
if one had a choice, one w ds ith the 
the firstborn of a family en iia da time. 
vounger sibling following doy nei 
The ee one acquires a younger sibling, 
thesooner one acquires the role of teacher. a 
If there already happens to be a firstborn 
in the family, then one would want to 
postpone another birth for a period of time, 
allowing the older sibling to gain a sub- 
stantial level of intellectual maturity. If 
we ignore other factors that influence 
intellectual growth, then under the con 
straints oi the confluence model, the 
parents’ dilemma can be formulated. How 
which is optimal (or fair) according to 
some criterion? If the gap is small, the 
firstborn benefits because he loses the 
"last-child" handicap early. But, the 
second born is brought into an inferior 
environment because of his older sibling's 
intellectual immaturity. The second can 
only benefit by a longer gap. But then 
the firstborn is at a disadvantage because 
he must spend a long period of his growth 
as the last born. Ignoring other factors, 
it is possible, in principle, to select a 
separation between children that minimizes 
(or maximizes) intellectual differences. It 
is equally possible to select a gap that maxi- 
mizes the average intelligence level. The 
dilemma may be exacerbated by the fact 
that a high average intelligence may be 
obtained only at the expense of large 
differences between siblings. These specu- 
lative implications, however, must be sub- 
stantiated by empirical information about 
the true effects of gaps. While the con- 
fluence model developed here leads to the 
derivation that, other things equal, gaps 
are beneficial, other things are seldom 
equal, and in reality the greater the gap 
the less the time siblings spend together— 
not only because the older is likely to leave 
home sooner but also because he engages 
in activities in which the younger cannot 
share. Hence, in practice gaps have 
optimal lengths. Studies that relate in- 
telligence to gaps (Cicirelli, 1967; Koch, 
1954; Thurstone & Jenkins, 1929) are 
based on samples that are small and do not 
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control for family 
Hence, they 
tion. They 


size 
do not allow 


ages 
The confluence model shows that en- 
vironment, even in the restricted sense 
used here, 


may have Powerful effects on 
the intellectual development 


vidual. dying that other 
factors, such as genetic background and 
other environmental] Processes not con- 
sidered here (such as child rearing practices, 
unique experiences, etc.), all 
But, although 
“ans to determine 
lation in intellectual 
by these factors and 

here, it is, neverthe- 
less, worth noting that much of the birth 
order data can be Parsimoniously explained 
by Considering no More than absolute 
intelligence levels of siblings during 


the 

Course of their develoament, 

"NN © confluence mode] need not be re- 
stricted to the growth of intelligence. 
Other processes that develop Over time 
may very well have a similar form, The 
confluence model formulates an analysis 


of a social 
a river. 


general flow, its tribut 
> and do not lose their ide 
e separate currents, one after 
join the mainstream, they affect th 
flow of the river and aret 
by uniting with it. The main feature of 
the formulation is tha 7 

considered to be a part T 
ment. And this environment is conceived 
of not as a static and Stable background 
condition, but as one that changes Over 


time, and one that is dynamically inter- 
dependent with 


As 


its components, The 
individual is continually influenced by his 
own environment, and being thus influ 


enced and changed, hims 
changes in his environmen 
very ownZchange. These 
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dent changes in the individual and in the 
environment Occur instantaneous]. 2 This 


interdependence is reflected in the simul- 


yed in the analy- 
the assessment 


point of time along a specified dimension, 
for example, intellectual level, and of the 
ever-changing environment that emerges, 
What we mean by “environment” Was the 
Surrounding intelligence whose level Varies 
as the Participants enter, leave, and change, 


As we have seen, temporal changes 


these 
follow 


a dilution Pattern, 

9 Carry out this Sort of analysis one 
requires a reliable Measure of the indi- 
Vidual quality under examination and a 
i quality over 
satisfies these require- 
a confluence analysis 
- If Srowth rates or 
not known, an 


a Suitable assump- 
tion is made, 


What Nose ae application oj the con- 
Nuence model look like in traits other 
intelligence? Clearly, the 
tween birth order 
ment Constitutes 


than the develop; 


relationship be- 
and personality develop- 
a more complex problem 
nent of intellectual abil- 
ity. One can, however, make some Specula- 
tions. With regard to traits such as 
a ee 

* [t is Possible to let 
of having it remain constant between 
births, as we did earlier, Let c 
Average of the intellect 
and adults in the give 
Then the rates of j 
the function SQ 2 a(1 — ewe 


by the differential 


@ vary continuously instead 


= 2R' (o . 

Or a given intellectual level, Mi, we have dy, dt 
= ktat) — M;(t)]. Note that the last ien 
handicap Would have to b Tepresenteq bya Separate 
Parameter, The obviou, vantages of a Serials 
* formulation are Offset p. the fact th ecause as 
now vary continuous} they can no | r be 
evaluated in relation em 


irth Order, family size, 


€ct intellectua] growth 
Tepeateg 


d Measures of all indi 

i are not available. Since t ere 

[o a quet rth order ata at present and since 

gituding Informati was not available for the 
elmont-M Tolla S 


S, a continuously changing 
Y a step function Where œ 
LY composition changed. 
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dependency, for example, it is a fair hy- 
pothesis that if the first child is in fact 


likely to be quite dependent on its mother - 


(Sears, 1950), the access to the mother by 
children born later may thereby become 
restricted. Later-born children may, there- 
fore, become less dependent than the 
firstborn because when young they learn 
to seek support from a greater variety of 
social sources, including other siblings. 
The development of dependency in the 
family context represents what seems to be 
a “preemptive” pattern: If earlier children 
are dependent, they force younger children 
into a situation that requires independence 
because the major social targets of de- 
pendence have been preempted. Other 
traits may show other patterns. Intelli- 
gence produces a "dilution" pattern. Affil- 
iation, on the other hand, may be "cumula- 
tive," with each successive child having a 
greater variety of affiliative targets and 
developing, therefore, greater affiliative 
skillss Gaps may influence the develop- 
ment of affiliative behavior, since dissimi- 
larity of ages may tend to reduce social 
contact. Status needs and dominance 
behavior are more likely candidates for 
a systematic relationship to birth order. 
If intellectual level tends to promote 
assertive behavior, and to the extent that 
earlier-born children are treated as ‘‘teach- 
ers" by younger siblings (and often given 
this responsibility quite explicitly by their 
parents), they are more likely to develop 
leadership skills. 

Not only permanent personality traits 
but also temporary individual differences 
that characterize the individuals' relations 
to a group might also be analyzed as they 
emerge in a social context. Take group 
cohesiveness, which has been conceptualized 
traditionally as some function of the attrac- 
tion of the individual members to the group. 
A group is formed. Ithas some goal. The 
nal founders are naturally very com- 


origi 
mitted to the group goal and are very 
attracted to each other. New members 


d. At the time of joining, their 
he group is lower than that 
g fathers. Therefore, by 
member dilutes the 


are recruite 
attraction for t 
of the foundin 
joining, the new 


"cohesiveness" environment of the group. 
The average attraction in this group is now 
lower than previously, and if there is a 
second recruit, he shall join a "'different" 
group than the first. And so on. We have 
a similar situation here as with intelligence 
and the family. To the extent that attrac- 
tion to the group depends on all the attrac- 
tions in that group in the same way that 
intelligence depends on intellectual en- 
vironment, and if we know the rate at 
which attraction grows and what is the 
pattern of entries, we can make predictions 
about differences in attraction of individual 
members to the group and about overall 
group cohesiveness. The study of changing 
norms and of conformity in groups with 
changing memberships may also benefit 
from the use of the confluence model. In 
fact, Jacobs and Campbell (1961) report 
experimental data on the transmission of a 
norm over "generations" of subjects that 
show a dilution pattern. However, for 
none of these illustrations, except perhaps 
for the last, is it possible at present to per- 
form an analysis such as was done for 
intelligence, birth order, and family size. 
Large sample data, together with solid 
information about rates of change, are 
required. 


REFERENCE NOTE 


1. Breland, H. M. Birth order, family configura- 
tion, and verbal achievement (Research Bulletin 
No. 72-47). Princeton, N.J.: Educational Testing 
Service, 1972. 


REFERENCES 


Altus, W. D. Birth order and scholastic aptitude. 
Journal of Consulting Psychology, 1965, 29, 
202-205. 

Altus, W. D. Birth order and its sequelae. Science, 
1966, 151, 44-49. 

Becker, G. sual acuity, birth order, achievement 
versus af tion and other Edwards Personal 
Preference Schedule scores. Journal of Psycho- 
somatic Research, 1965, 9, 277-283. 

Belmont, L., & Marolla, F. A. Birth order, family 
size, and intelligence. Science, 1973, 182, 1096- 


1101. 
Bradley, R. W. Birth order and school-related 
behavior: A heuristic review. Psychological 


Bulletin, 1968, 70, 45-51. n 
Breland, H. M. Birth order, family size, and 
intelligence. Science, 1974, 184, 114. 


SS R. B. Zajoxc AND 


Churchill, J. A. The relationship between intelli- 
gence and birth weight in twins. Neurology, 1965, 
15, 341-347. 

Cicirelli, V. G. 
IQ, and academic achievement. 
ment, 1967, 38, 481—190. 

Clarke, E. L. American men of letters, their nature 
and nurture. New York: Columbia University 
Press, 1916. 

Damrin, D. E. Family size and sibling age, sex, 
and position as related to certain aspects of 
adjustment. Journal of Social Psychology, 1949, 
29, 93-102. 

Eisenman, R. Birth order and artistic creativity. 
Journal of Individual Psychology, 1964, 20, 
183-185. 

Farina, A., Barry, H., & Garmezy, N. Birth order 
of recovered and nonrecovered schizophrenics. 
Archives of General Psychiatry, 1963, 9, 224-228. 

Gelfand, S. The relationship of birth order to pain 
tolerance. Journal of Clinical Psychology, 1963, 
19, 406-407. g 

Glass, D. C., Horwitz, N., Firestone, I., & Grinker, 
J. Birth order and reactions to frustration. 
Journal of Abnormal and. Social Psychology, 1963, 
66, 192-194. 

Green, C. E. The effect of birth order and family 
size on extra-sensory perception. Journal of the 
Society for Psychical Research, 1965, 43, 181-191. 

Innes, J. M., & Sambrooks, J. E. Paired-associate 
learning as influenced by birth order and the 
presence of others. Psychonomic Science, 1969, 16, 
109-110. 

Jacobs, R. C., & Campbell, D. T. The perpetuation 
of an arbitrary tradition through several genera- 
tions of a laboratory microculture. Journal of 
Abnormal and Social Psychology, 1961, 62, 
649-658. 

Koch, H. L. The relation of “primary mental 
abilities” in five- and six-year-olds to sex of child 
and characteristics of his siblings. Child Develop- 
ment, 1954, 25, 209-223. 

"n s Sibling position and suicidal behavior. 

ournal of Individual Psycholo 
204-207. penamey. i968, 22, 

Levinger, G., & Sonnheim, M. Complimentarity 
in marital adjustment: Reconsidering Toman's 
family constellation hypothesis. Journal of 
Individual Psychology, 1965, 21, 137-145, 

Lewis, M. M. Language, thought and personality in 
oe and childhood. London: Harrap Press, 

Maller, J. B. Size of family and personality of off- 
pyra Journal of Social Psychology, 1931, 2, 


Sibling constellation, creativity, 
Child Develop- 


Grecory B. Markus 


McCall, R. B., Appelbaum, M. 1., & Hogarty, P. S. 
Developmental changes in mental performance. 
Monographs of the Society for Research in Child 
Development, 1973, 38, (3, Serial No. 150). 

Mehrotra, S. N., & Maxwell, J. The intelligence of 
twins. A comparative study of eleven-year-old 
twins. Population Studies, 1949, 3, 295-302. 

Record, R. G., McKeown, T., & Edwards, J. H. 
An investigation of the difference in measured 
intelligence between twins and single births. 

Annals of the Human Genetic Society, 1970, 34, 
11-20. 

Sampson, E. E. Birth order, need achievement, 
and conformity. Journal of Abnormal and Social 
Psychology, 1962, 64, 155-159. 

Schachter, S. The psychology of affiliation. Stan- 
ford, Calif.: Stanford University Press, 1959. 

Schachter, S. Birth order, eminence and higher 
education. American Sociological Review, 1963, 
28, 157-768. 

Schachter, S. Birth order and sociometric choice. 
Journal of Abnormal and Social Psychology, 1964, 
68, 453-456. 

Schooler, C. Birth order and hospitalization for 
schizophrenia. Journal of Abnormal and Social 
Psychology, 1964, 69, 574-579. 

Schooler, C. Birth order effects: Not here, not now! 
Psychological Bulletin, 1972, 78, 161-175. 

Sears, R. R. Ordinal position in the family as a 
psychological variable. American Sociological 
Review, 1950, 15, 397-401. 

— R. F. oon pee and juvenile de- 

inquency. American Journ T ] 
PCS ournal of Sociology, 1934, 


Smart, R. G. Alcoholism, birth order. 
size. Journal of 
1963, 66, 17-23. 


Stein, Z., Susser, M Saen; 
Dou ?USser, M., Saenger, G., & Marolla, F. 
Te orana mental performance. Science, 1972, 

Thurstone, L. L., & Jenkin: 

me, L. L., s, R. L. Birth 

and intelligence. Journal of Educational pn 
chology, 1929, 20, 641-651. "t 
Ward, C. D. A further examin: 
as a selective factor among 


Journal of Abnormal 
"m Co SN rmal and Soc 


Willis, C. B. The eff 
s C. B, ects of pri i 
Eee capacity. Journal of ature On tins 
wae Psychology, 1924, 18 375-377 EE 
iison, R. S. Twins: E; ; 1 
Science, 1972, 175. 914.91] mental development, 


, and family 
Abnormal and Social Psychology, 


ation of birth order 
„Volunteer subjects, 
tal Psychology, 1964, 


(Received April 22, 1974) 


Teaching 
Psychology 
in 


Secondary 
Schools 


Coauthors 
RICHARD A. KASSCHAU 
and 
MICHAEL WERTHEIMER 


Presents a historical review of the 
growth and development of high 
school psychology courses in the 
U.S. In so doing, the foundation is 
‘laid for an informed discussion of 
the philosophical concerns of the 
present-day psychology teacher. 
The pros and cons of two major 
perspectives, "psychology for per- 
sonal adjustment" and "psychology 


"I3 asa discipline,” are presented anda 
u 


synthesis of these approaches is 
outlined and evaluated. This book 
also provides valuable information 
on available curriculum materials, 
references useful in obtaining these 
materials, and teacher training and 
certification curricula and criteria. 
1974. 64 pp. $2.50. 


Please note that payment must ac- 
company all orders amounting to 
$15 or less except for those orders 
submitted on institutional pur- 
chase order forms. 


American Psychological Association 


icati es Dept. 
Publication Sales 
1200 17th Street, N.W. INVA 
Washington, D.C. 20036 


Publication 
Manual 


of the American 
Psychological Association 


Second edition, 1974. Cur- 
rent guide to the preparation 
of manuscripts for APA and 
related journals. This edition 
adds many examples, includ- 
ing an entire sample paper. 
New sections on student 
papers, metrication, and 
mathematical copy. En- 
larged sections on content 
and organization of an arti- 
cle, typing the manuscript, 
and all aspects of APA style, 
including reference lists and 
tables. 136 pp. $4. 


Please note that payment 
must accompany all orders 
amounting to $15 or less ex- 
cept for those orders sub- 
mitted on institutional pur- 
chase order forms. 


American Psychological 
Association 


Publication Sales Dept. y S 


1200 17th Street, N.W. 


Washington, D.C. 20036 


Contains over 4,000 terms representing an 
authoritative, unambiguous, and hierarchically 
by Psychology and 
abulary will facilitate 


focused in accord- 
Psychologists, 


This Thesaurus, developed b 
Abstracts Information Servic 
three sections: 


y the Psychological 
es, is divided into 


The relationship Section shows the relation. 
ships among concepts that are as; 
each other. T. i 


ng rapid 
ed for indexing. 


tains 17 broad con- 
Presented in de- 
readth of Concept 
ies Correspond to 
in Psychological 


The relationship, alphabetical, and hierarchical 
Sections will assist the user in Selecting appro- 
Priate terms for different uses. 

Administrators 


* determine areas Of research fo; 


* assure Tepresentative expert 
Selection 


r an organization 
ise in personnel 


* structure a curriculum 
*Uuseasa tutorial aid 
Consultants 

* Search the literature 

® aid in retrieval of information 
Librarians 
* provide ac 


Ncepts relevant to psyc 
* focus areas of study ji Elegy 


The Thesa 


PSYCHOLOGICAL 


REVIEW 


JEAN M. MANDLER, ASSOCIATE EDITOR 
UNIVERSITY OF CALIFORNIA, SAN DIEGO 


Vol. 82, No. 2 March 1975 


Articles 


On the Temporal Control of Behavior 
Peter Killeen 


Arousal, Activation, and Effort in the Control of Atten- 
Honest m M. Karl H. Pribram and Diane McGuinness 


Theoretical Notes 
Generation-Recognition Theory and the Encoding 


Specificity punoinie, TEM E Edwin Martin 
Features and Semantic Memory: Set-Theoretic or Net- 
WOIKSMOGeI?I oso RR AE E. y James D. Hollan 


Set-Theoretic and Network Models Reconsidered: A 
Comment on Hollan's "Features and Semantic 


Memory" 
Lance J. Rips, Edward E. Smith, and Edward J. Shoben 


Metabolic Hormones and Regulation of Feeding: A 
Reply to Woods, Decke, and Vasselli 
Jaak Panksepp 


Insulin, Growth Hormone, Body Weight, and Feeding: 
A Reply to Panksepp l 

Stephen C. Woods, Elisabeth Kaestner, 

and Joseph R. Vasselli 


89 


116 


150 


154 


156 


158 


165 


rrr ee E E 


Published bimonthly by the American Psychological Association, Inc. 


Editorial Consultants 


The editor gratefully acknowledges the efforts of the following 
scripts for this issue, They are Part of a large grou 


consultants who helped review manu- 
the editing of manuscripts for Psychological Review. 


P of psychologists who have kindly agreed to assist in 


NORMAN ADLER Davip H. KRANTZ ROBERT G, PACHELLA PuiLiP Terr ELBAUM 
JonN R. ANDERSON Howarp LEVANTHAL Davin E. RUMELHART RicHARD F, THOMPSON 
NORMAN H. ANDERSON Lean L. Licur THEODORE SARBIN Roc E. WALLEY 

A. M. COLLINS JOHN Morton JOAN G. SonGrass RICHARD E. WHALEN 
IRVING L. Janis LLoyp NAKATANI Jons E.R. STADDON FREDERICK WIGHTMAN 
H. M. JENKINS 


GEORGE WOLFORD 


any area of scientific 

e Papers that advance theory rather than review it, and to statements 

that are specifically theoretical ra; i € literature, problems of method 

i i inarily not appropriate. Letters 
theoretical interest should be submitted in duplicate, 

ns for the Preparation of Manuscripts appear in the Publication 

anual of the American Psychological A lation (2 d Submit three copies; one Copy must be the 

In order to facilitate blind re- 

Stract, but each copy of the manu- 

page showing the title and the author's name and affiliation. Footnotes 

-. Figures should be submitted as glossy prints 

may be photocopies, Metric equivalents must be given for all 

nonmetric measurements, Authors should keep a Copy of the manuscript to guard against loss, 

Abstracts: Manuscripts must be 


1 t accompanied by an abstract of 100 to 175 words typed on a Separate sheet 
of paper. Detailed instructions for Preparation of abstracts appear in the Publication Manual or may be 
obtained from the Editor, 


Script should include a Separate title 
that identify the author should be t 


Manuscripts: Submit manuscripts in triplicate to the Editor, GEORGE MANDLER, De 
fornia, San Diego, La Jolla, California 92037 according to instructions Provided 
Copyright and Permissions : All rights reserved, Written Permission must 

tion for copying or reprinting text of 


ained f, 4 , 
more than 500 words. tables, or figures ior the American Psych 
ike permission of the author, inclusi i 


partment of p, 
above, 


5ychology, University of Cali- 


ce their own articles. Address 


venteenth Street, N.W., 
Subscriptions: Subscriptions are available PA calendar 


$15 domestic, $16 foreign, $3 single issue, 
Reprints: Authors may order reprints of their articles from the Printer when they receive galley proofs. 
Back Issues and Back Volumes: For information Tega, a 


tding back issues or back volumes write to Ord, 
Microform Editions: For information regarding mier, foi i 
New p, Greenwich, Connecticut 068I ta Microjjations write t 


year basis on] nuary 
PA member Tate: $6. Weanuan rough December), 


Write to Subscription Section; A pA member Jureioripres 


: Johnson Associates, Inc., P.O, 
ior Princeton Microfilms, Princeton, 


postage. 


Published bimonthly (beginning in January) in one volume Per year, six jug BY th , 
2 venteenth Street, N, W., Washington, D, C. 3 36 : SA: PY the American p. 
Hus mailing offices, 00 Printed in the U.S.A, Second-class Posta 


inp Tt address changes ayi yi 
writing that they will n pul act jcblaced; Second- 


lass. forwarding 


ge nological Association Inc, 
ge paid at Washington, De 


Ha&oLD P. VAN Corr ANITA DEViyo DEBORAH S FE 
5 " 2 WEIT d ZLDE 
Managing Editor Executive Editor and Supervising Teel ae Mary Grut, 
eneral Manager Editor "cal Technical Editor 
LYNN D. Spree ANNE Repy 
Manager, Publications ubscripti i) 
usiness Services | HBUODSNE 


anager 


— 


Copyright © 1975 by the American Psychological Association, Inc. 


Vor. 82, No. 2 


On the Temporal Control of Behavior 


Peter Killeen 
Arizona State University 


A series of experiments ranging over a number of animal species, several 
classes of behavior, and a variety of experimental conditions provided the 
foundation for a mathematical model of the temporal control of behavior. 
The initial research explored the general activity of pigeons during the 
interval between food deliveries; most of the variance in that data was 
accounted for by the algebraic sum of two exponential functions. Parameters 
of the model were interpreted as indices of arousal level and inhibitory 


processes which detracted from that level. 
ons when behavior was modified by the 


ways appropriate to those interpretati 


The parameters changed in 


action of various drugs, reinforcement contingencies, and competing behaviors. 
The model was mapped onto a series-latency mechanism and extended to 
account for various schedule-induced, operant, and respondent behaviors. 


Perhaps the most basic control procedure 
in the study of conditioning is the periodic 
presentation of the reinforcer—or uncondi- 
tioned stimulus—independent of the organ- 
ism's behavior and in the absence of any 
associated stimulus change. Such procedures 
have given rise to a number of theoretically 
important discoveries. Pavlov (1960) found 
conditioning to occur readily under such cir- 
cumstances and called the process temporal 
conditioning because of the inferred temporal 
conditioned stimulus. Skinner (1948) found 
that walking and other gross movements in- 
creased in frequency under such circum- 
stances. He called the process superstitious 
conditioning, citing the fortuitous contingu- 
ity between various responses and the pe- 
riodic reinforcer. Campbell and Sheffield 
(1953) employed the paradigm to study the 


increase in spontaneous activity that was said 


MM 
The author is indebted to Arizona State Univer- 
or support through several summers. 


ity f ; 
si Requests for reprints should be sent to Peter 
Killeen Psychology Department, Arizona State 
University, empe, Arizona 85281. 
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to occur with deprivation. They found that 
the increased activity of rats actually re- 
flected a hyper-reactivity to normal stimuli ; 
the effect of hunger “is to augment the nor- 
mal activity responses to novel stimuli and 
to greatly augment the activity to stimuli as- 
sociated with the performance of the consum- 
matory response” (Sheffield & Campbell, 
1954, p. 99). Amsel and Work (1961), 
Slivka and Bitterman (1966), and Zamble 
(1967) have also shown that the presenta- 
tion of stimuli that have been paired with a 
reinforcer increases general activity. Such 
studies have contributed impetus to the re- 
cent movement away from drive-reduction 
theories and toward incentive-motivation ac- 
counts of behavior (cf. Bindra, 1970; Bolles, 
1967; Cofer & Appley, 1967). Recent stud- 
ies of autoshaping (e.g, Brown & Jenkins, 
1968) are the logical extensions of this line 
of enquiry. 

Not all activity during periodic feeding 
schedules is associated with stimuli occur- 
ring prior to reinforcement ; hyperreactivity 
to stimuli occurring after reinforcement has 
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also been observed.  "Schedule-induced" 
polydipsia, wheel running, gnawing, aggres- 
sion, and pica (Falk, 1972; Segal, 1972) ex- 
emplity this type of "adjunctive" activity. 
Staddon and Simmelhag (1971) have also 
reported unreinforced sequences of behavior 
recurring throughout the interval between 
feedings. 

Activity has been variously taken as an 
integral part of the normal conditioning pro- 
cess (Sheffield, 1966), as an adventitious 
by-product of conditioning by contiguity 
(Skinner, 1948), as a sign of frustration 
(cf. Scull, 1973), and as an adjunctive be- 
havior induced by a schedule oí reinforce- 
ment (Levitsky & Collier, 1968). Most of 
these evaluations have been made within the 
context of one or another theoretical para- 
digm; few have incorporated extensive para- 
metric measurements of activity ; none have 
adduced an adequate quantitative description 
of the course of activity over time. The fol- 
lowing experiments demonstrate that an in- 
crease in activity level is as fundamental a 
consequence of reward as is the acquisition 
of learned responses; that activation is not 
reducible to the law of effect; and that ac- 
tivity increases to a maximum soon after a 
signal of impending reward and then de- 
creases as a function of interference by com- 
peting responses. These and other implica- 
tions are drawn from an empirically derived 
model of activation, An extension of the 
model adequately describes the temporal 
course of typical operant and respondent 


behaviors and of schedule-induced behaviors 
such as polydipsia, 


ACTIVITY AS A Function or 
INTERFOOD INTERVAL 
Experiment 1 


Method. Four White Carnea 
previous histories of experimentation, served as 
subjects. The wooden experimental chamber was 
46 cm wide, 31 cm deep, and 34 cm high. A feeder 
that dispensed mixed grain was centrally located 
on the widest wall, and a houselight was located 
directly above it near the ceiling. This chamber 
was placed in a sound-resistant enclosure, which it- 
self was located in a room separate from the con- 
trol and recording equipment. The floor of the 
chamber consisted of six movable Panels, 


ux pigeons, all with 


three 
hinged at the rear of the chamber and three hinged 
at the front. Microswitches under these panels 


indicated when the animal moved from one part oí 
the chamber to another or shiíted its weight while 
staying in one part of the chamber. Each activa- 
tion of a microswitch counted as a single "activity 
response." Data írom the six switches were 
pooled with no account taken of where in the 
chamber a response occurred. 


The pigeons were reduced to 80% of their free- 
feeding weight and exposed to various feeding 
schedules. On these schedules the food hopper 
was activated for 3 sec at various interfood in- 
tervals (25, 60, 120, 200, and 400 sec). Half of 
the animals experienced the intervals in an ascend- 
ing series, and half in a descending series. To pre- 
clude adventitious reinforcement of pacing, food was 
delivered no sooner than 5 sec after the last re- 
corded activity in the chamber (5-sec Differential 
Reinforcement of Other Behavior, or DRO). Ten 
daily sessions were conducted at each interfood in- 
terval. Each session lasted for 60 feedings, except 
at the longer intervals where sessions were termin- 
ated after 3 hours. A stepper distributed responses 
to one of nine counters as a function of the time 
since the last delivery of food. 

In a second experiment (1b) four naive pigeons 
experienced interfood intervals of 5, 10, 15, and 30 
sec, according to a Latin square design. For thesc 
animals no DRO contingency was employed. 


Results 


The average response rate for each animal 
in each condition may be found in Table 1. 
Figure 1 shows the rate of activity as a func- 
tion of the time since the last delivery of 
food. The data are averages (geometric 
means) over four animals, averaged over the 
last five sessions at each condition of Experi- 
ment la. The abcissa has been normalized 
for convenience in presenting the data. 

The data for the three shorter interfood 
intervals all show a pronounced increase in 
activity after feeding, an increase that is 
maximal one-fifth of the way through the 
interval. The overall rate of activity 15 
Sreatest for the shortest interfood intervals 
and decreases monotonically with increasi”"S 
interfood intervals, 

. Response rates were substantially higher 
m Experiment Ib, but the time-course of 2C 
tivity through the interfood interval W4S 
x cd Activity was maximal one-quarter 
y into the interval and was minim: 
ely after feeding, The decrease ” 


before feeding was not nearly 99 
marked as in Experiment la, presumably be- 
cause no DRO 


contingency was employed. 
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TABLE 1 


INTERFOOD INTERVAL, RESPONSE RATE, AND 
PARAMETER VALUES FOR EQUATION 1 IN 
EXPERIMENTS la AND 1b 


: i Re- 
eho aKsrves iC A c 1 ss 
minute 

Experiment 1b 
5 61.2 | 600 37 | 24 | 76 
10 63.6 | 318 46 | .20 | 80 
15 58.4 | 185 S5 [46 |} d» 
30 55.1 130 .86 | .16 | 89 

Experiment la 
25 21.0 | 134 .30 | .13 | 24 
60 15.6 47 Ad | .07 | 18 
120 10.7 32 .46 | .07 | 12 
200 74 10 | 1.63 | .02 | 16 
400 5.7 7 | 3.42 | .00 | 22 


a Number of parallel processes necessary to account for the 
data if they are assumed to derive from a series latency 


mechanism. 


Discussion 


A surprising amount of general activity 
occurred in this feeding situation despite con- 
tingencies that reinforced quiescence. Other 
investigators have also noted the difficulty of 
getting animals to stand still using food as a 
reward (Blough, 1958; Breland & Breland, 
1966). The situation is the inverse of that 
dealt with by Bolles (1970b) and Keehn 
(1972), who noted the ease of getting rats 
to “freeze” in the context of occasional elec- 
tric shock. Bolles proposed the existence of 
various “species-specific defense reactions,” 
such as freezing, fleeing, and fighting, which 
were especially easy to condition with aver- 
sive stimuli, It may well be that the floor 
panels of the chamber used in these experi- 
ments provide a gross measure of “species- 
specific appetitive reactions,” behaviors that 
are either induced by the recurring food de- 
liveries, or are easily conditioned through 
adventitious pairing with food. The latter 
notion is suspect, however, since the 5-sec 
DRO was specifically designed to minimize 
superstitious behaviors. . 
Tt is curious that in both experiments ac- 
tivity decreased before the delivery of food. 
This decrease was arranged by the DRO 
contingency of the first experiment, but in 

casement. ib there was no such contin- 
Experiment- ossible that the motivation 
gency. Itis P 


generated by food is simply exhausted by 
the end of the interfood interval. This is 
unlikely, however, for all of the curves peak 
at the same relative time after food delivery 
and fall to the same level before the next 
delivery. Were these changes coincident on 
an absolute time scale, the exhaustion of a 
reserve would be a plausible explanation. 
Their coincidence on a relative time scale 
denotes a greater dependence on the schedule 
of food delivery than on the aftereffect of 
any single delivery. 

A more likely explanatio for the decrease 
in activity is that competing behaviors, in- 
compatible with general activity, are elicited 
by the temporal proximity of food. These 
activities might include consistent orienta- 
tions to the front of the chamber, pecking at 
the front wall, and investigation of the mag- 
azine aperture. Such behaviors have been 
observed at the end of a free-food interval 
by Staddon and Simmelhag (1971), and at 
the end of a conditioned stimulus by Zamble 
and Kirkwood (1969); they would clearly 
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Ficure 2. Hypothetical curves representing: A 
constant asymptotic rate of A Tesponses per minute 
(solid horizontal line), general activity with no 
competing terminal behaviors (rising dashed curve, 
from Equation 3), terminal behaviors suffering no 
inhibition by reinforcement (falling dashed curve, 
from Equation 4), and the time course of general 
activity (bitonic solid line, from Equation t); 


interfere with general activity as measured in 
the present experiment, A more careful 
analysis of prefood behaviors is conducted 
in the second series of experiments. 


A MATHEMATICAL Mopzr 


The theoretical cu 


Tves accompanying the 
data are derived fro 


m the equation 
R= A(et/o — eut). a] 
is rate in re- 
S the base of 
Values for the 
Which were de- 
quares program, 


The Proportion 
measured by ¢ 


The parameter 


The dependent variable, R, 
sponses per minute, and e į 
the natural logarithms, 
parameters A, C, and T, 
rived by an iterative least-s 
may be found in Table 1, 
of the interfood interval 
ranges between 0 and 1.0, 
A is a scale factor, affected by the units 
of measurement and the sensitivity of the 
system. Of greater theoretical importance, 
A provides a measure of the maximum level 
of activity engendered by the feeding sched- 
ules. In the absence of competing behaviors 
or inhibitory stimuli, Equation 1 would pre- 
dict a constant rate of A panel activations 
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per second (cf. top solid line, Figure 2). If 
from that constant rate we subtract 


R= Ae“, 


[2] 


we obtain 
R- A(1—et'1), [3] 


whose locus is traced by the rising dashed 
line in Figure 2. Parameter I governs the 
rate at which the response curve approaches 
maximum (A) ; the smaller I, the steeper the 
ascending part of the curve, Parameter I 
is interpreted as a measure of the inhibition 
of activity by food delivery which is believed 
due to the role of food in signaling a period 
during which the next food delivery is un- 
likely (cf. Jensen & Fallon, 1973; Siegel & 
Domjan, 1971; Staddon, 1974). 

Consider now the equation 

R = Act, [4 

Which is traced as the falling dashed line in 
Figure 2. This curve accurately represents 
the right tail of the activity curve, approach- 
ing O as ¢ grows large. The decrease in ac- 
tivity before the delivery of food (at t= 1.0) 
is presumed to be due to the presence of 
competing behaviors which occur toward the 
end of the interfood interval and which in- 
terfere with general activity. In Experiment 
2, the rate of Some terminal behaviors is 
modified directly by reinforcement, and the 
predicted changes in C are witnessed. 

When these two basic processes are joined 
—the fast acting inhibition of Equation 2 
being subtracted from the slower decay of 
Seneral activity modeled by Equation 4— 
Equation 1 and its curve at the bottom of 
Figure 2 result, Equation 1, which was 
empirically derived, has close relatives in at 
least two other formal systems, control the- 
ory and stochastic latency models. 

Control, or cybernetic, systems are those 
which employ some form of feedback to 


moderate the output of the system (cf. Mc- 


Farland, 1971; Milsum, 1966) Examples 
Tange from servo 


. "mechanisms to thermo- 
Stats, and in humans include the gamma- 
efferent control of limb movement and the 


mechanisms which govern pupil aperture. 
Random pulses to such homeostatic sys- 
tems will 


cause momentary disruption, fol- 


lowed by asymptotic return to the set-point. 


—Ss 


= 
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One of the simplest models of this asymptotic 
return is provided by the exponential func- 
tion. The impulse carries the system to some 
level A, and this is followed by a return to 
the reference level whose course is described 
by an equation such as 4. Immediately after 
the impulse (t = 0) the system is at Level 
A, and as £ grows large the function ap- 
proaches 0. The rate of return to reference 
level (0) is governed by C, which is called 
the “time-constant” of the system. When 
t — C, the system has completed 63% of its 
return to equilibrium. 

Equation 1 models a more complicated 
system in which the disruption is not max- 
imal until some time after the impulse. An 
analogue is found in the striking of a very 
elastic ball. The energy of the impulse is 
first absorbed in deforming the ball, and 
only moments later is effective in propelling 
the ball through space. Equation 1 repre- 
sents a second-order control system with two 
time constants, C and I. If the restoring 
force, 1/C, were 0 (cf. Equation 3), the sys- 
tem would some to rest at Level A and 
would be 6396 of the way to that level after 
a time equal to I. For Cs less than infinity, 
one contends with the difference between two 
exponential curves, one tending to carry the 
system asymptotically to Level A and the 
other tending to carry it asymptotically back 
to the reference level. The form of the re- 
sultant curves depends on the values of the 
time constants, with those found in Figure 1 
being typical examples. 

The reciprocal of the geometric mean of 
C and I defines the "resonant," or "natural" 
frequency (o) of a control system. This in- 
dex remained approximately constant at 54 
in Experiment la and at 3.2 in Experiment 
1b. The value of o reflects the responsive- 
ness of the system ; since the obtained values 
are “faster” than the input frequency (ie., 
the rate of feeding, here given a value of 1), 
the level of activity should rise to its max- 
imum and drop back to equilibrium relatively 
Tn the normalized coordinates of 


e constancy of e indicates that 


frequency of the system is a 
f the actual interfood interval 
“stays in-tune" with its feed- 


The cybernetic interpretation 


quickly. 
Figure 1. th 
the natural 
linear function © 
—the organism 
ing schedule. 


of Equation 1 will be further examined later 
in the General Discussion section. 
Equation 1 is also found in the analysis 
of stochastic latency mechanisms (McGill, 
1963), as one instance of a “general-gamma 
distribution” (McGill & Gibbon, 1965). If 
an event such as “Response A” has a fixed 
probability of occurring in any small unit of 
time, then the probability that the response 
will occur by time £, f(£) has a geometric 
distribution that is approximated by the 
exponential : 
f(t) Ae. [5] 
Consider now a series latency mechanism 
wherein the response latency has two com- 
ponents, 
t= d de. [6] 


The values / and f» are random delays asso- 
ciated with the two components which must 
be completed in sequence before a response is 
recorded, and 


f(t) = e-tlt/T, [7] 
f (to) = e-t'c/C. [8] 

Then 
i)» gie em. [9] 


Equation 9 resembles Equation 1. although 
each describes a quite different set of data— 
the former changes in response rate, the 
latter a probability density function. The 
curves may be made congruent by assuming 
that each panel activation is the outcome of 
a unique two-component series latency mech- 
anism (Equation 9) and that many such 
mechanisms operate in parallel. The random 
completion of each of the two-step processes 
throughout the interval will result in a single 
measured response; the curves of Figure 1 
result from the completion of many such pro- 
cesses. The first event in this two-step pro- 
cess, specified by Equation 7, need not be an 
observable response or any sort of response 
as those are normally construed. It may be 
conceived as the timing-out of an internal 
clock which signals the end of a quiescent pe- 
riod. There ensues a second period of time 
terminated by a recorded response. 

One of the quick returns on an investment 


in stochastic latency mechanisms is an esti- 
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mate of the number of parallel processes 
necessary to account for the data. Since 
each process will generate a function with a 
coefficient of 1/(C — I), n processes are 
necessary to generate a function with a coeí- 
ficient of A, where n = A(C— I). Values 
of n are listed in Table 1. It can be seen 
that the estimates of n range around 80 for 
Experiment 1b and around 18 for Experi- 
ment la, Because A incorporates a scale 
factor, the estimates are unique up to a 
multiplicative constant, The estimates show 
no dependence on interfood interval but do 
Show a large increase in Experiment 1b, 
Where activity was no longer suppressed by 
a DRO contingency. The correlation of the 
hypothetical gamma processes with observed 
behaviors and their employment as minimal 
units of behavior are the types of potential 
benefits which might justify the assumptions 


of underlying series latency mechanisms, 
Tk A n CHTIO Ta sicci Ts aree] to ‘re se, 
latency mechanism, Equation 10 results ; 
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t=M. Equation 11 will subsequently be 
used to describe the time course of various 
adjunctive and terminal behaviors. 


" 
CoNCURRENT AND CoMPETING OPERANT 
BEHAVIORS 


The parameter C has been identified as 
descriptive of behavior occuring predom- 
inantly near the end of the interval. The 
smaller C, the more quickly activity subsides 
to near-zero levels. In the first experiment, 
a concurrent operant response—key pecking 
—is established. Insofar as this response is 
incompatible with general activity, a further 
suppression of responding should occur near 
.the end of the interval and should be re- 
flected in a decrease in C. In the second 
experiment, general activity is first sup- 
pressed with increasingly severe DRO con- 
tingencies and then is reinforced with a 
positive contingency between activity and 
feeding. 


Experiment 2a 

Two pigeons were trained to peck at a 
Gerbrands response key centrally located 
above the hopper in the experimental cham- 
ber, the first key peck to occur 60 sec after 
the last reinforcement activated the hopper 
for 3 sec. 

The introduction of a contingency requir- 
ing a key-peck response appeared to have 
had little effect on the pattern of general 
activity, although the overall level of re- 
sponding was lower than that obtained for 
the same animals during Experiment la. 
But examination of the parameters leads to 
a different conclusion. A and I did not 
change from control values (54 and. .09) 
when the concurrent response was intro- 
duced, but C decreased from .98 to .24. 
The lowering of activity level was attribut- 
able solely to a change of the parameter 
governing the rate of return to equilibrium. 


Experiment 2b 

Three naive pigeons were maintained at 
8096 of their free-feeding weights. Three 
econds of access to grain, normally occur- 
s 


i in invariant if 
ters were said to remain invarian 3 
: a ica to vary, the data variance accounted 
ona ite model increased by less than 1%. 
or by 
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ring every 60 sec, could be delayed by panel 
activations occurring during the last few 
seconds of the interval: DRO contingencies 
of 0, 5, and 12 sec were employed in ascend- 
ing order. 

In the last condition of this experiment, 
the DRO was replaced with a fixed-ratio 
schedule. After 55 sec had elapsed, each 
panel activation would count toward rein- 
forcement, which was delivered after 10 ac- 
tivation responses. The purpose of this 
scheduling was to encourage high response 
rates while attempting to maintain constant 
the interfood interval. 

Results. For all conditions except fixed 
ratio, rate was greatest about one-fifth of 
the way through the interval. Overall 
rate of responding decreased with increases 
in DRO length (cf. Table 2). The pres- 
ence of a DRO contingency affected the 
shape of the curves, but even without that 
contingency, activity decreased before rein- 
forcement. The fixed-ratio contingency in- 
creased response rates, but the shape of the 
activity curve was not the familiar "scallop" 
or the “break and run" usually obtained on 
ratio schedules. Instead, activity was greatest 
about halfway through the interval and de- 
creased slightly before reinforcement. These 
data are essentially identical to those col- 
lected in a pilot study with experienced 
subjects. 

Once again the data have been fit with 
curves derived from Equation 1, with param- 
eters listed in Table 2. The radical changes 
in terminal contingencies have no effect on 
I. Changes in the terminal contingencies 
are correlated with changes in C, which de- 


TABLE 2 


EXPERIMENTAL CONDITION, OBTAINED INTERFOOD 
INTERVAL, AND PARAMETERS OF 
EQUATION 1 FOR EXPERIMENT 2b 


meu — | PeT 
contingency interval A S T 
(secs) 
DRO 0 60 39.7 87 .05 
DRO 5 67 36.6 .68 .05 
DRO 12 75 24.1 .52 .05 
Fixed ratio 10 98 22.5 7.14 .05 


Note. Abbreviation: DRO = Differential Reinforcement of 
Other Behavior. 
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Ficure 4. Rate of panel activation in Experi- 
ment 3a before blackout (.0 to .5) and before rein- 


forcement (.5 to 1.0). (The dashed line represents 
the locus of the control data.) 


creased with increasing DRO requirements, 
and increased markedly under the fixed-ratio 
contingency. 

In Experiment 2a, although the level of 
general activity decreased upon introduction 
of the competing behavior, the theoretical 
asymptotic level of activity (A) did not 
change. In the present experiment, the 
value of A did vary with changes in the 
terminal contingencies, although the values 
were not monotonically related to t 
in terminal response rates, 
that changes in A were due 
tions in the obtained interval, for imposition 
of the DRO contingencies decreased the rate 
of reinforcement (see Table 2), and a fur- 
ther decrease occurred when the fixed. 
contingency was introduced. 

Discussion. ‘These experiments show that 
activity may be suppressed, either by the in- 
troduction of competing behaviors 
direct punishment of that activity, Equa- 
tion 1 remains an appropriate model, with 
the effects of these manipulations Primarily 
manifested in one parameter, C, 


he changes 
It is probable 
solely to varia- 


-ratio 


or by 


EFFECTS or INTERPOLATED STIMULI 


The curves in Figure 1 are reminiscent of 
curves drawn by Konorski (1967) to depict 
the “hunger drive” which occurs in anticipa- 
tion of food, is suppressed during eating, and 
is “restored with rebound after the [eating] 
response is over" (p. 46). Their nature is 
more complex, however, than the simple 
model proposed by Konorski. The peak 
"rebound" of these curves does not bear a 
fixed temporal relationship to the eating re- 
sponse: Instead, the maximum occurs a fixed 
proportion of the distance until the next 
feeding. This anticipatory nature of the 
data seems to rule out a simple reflexive 
interpretation, such as Konorski's. 

In the first series of experiments two ef- 
fects of reinforcement were thoroughly con- 
founded. The delivery of food was both the 
main source of energy or motivation for the 


subjects, and thereby undoubtedly had direct 
effects on their behavior, and it was also a 


signal with a fixed temporal relationship to 
the subsequent reinforcement, and thereby 
undoubtedly acquired incentive motivational 
(Bindra, 1968) or inhibitory (Staddon & 
Innis, 1969; Staddon, 1970) properties, In 
the following series of experiments additional 
stimuli are added in various temporal rela- 
tions to reinforcement, and are found to have 


patent inhibitory and excitatory effects upon 
the course of activity, 


Experiment 3a 
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creases quite dramatically, peaking at a rate 
higher than that found after food, and even 
higher than the maximum rate during 60 
sec interval pretraining of these subjects. 

The present data seem consistent with 
studies of the “Frustration Effect” (Amsel, 
1962) or of the patterning of running in 
studies of alternate reinforcement in straight 
alleys (Capaldi, 1966, 1970). In both cases, 
running speed is greater on the trials after 
nonreinforcement than after reinforcement. 
Indeed, the present curves might easily be 
taken for running-speed data, for they show 
the characteristic acceleration and decelera- 
tion associated with the movement of a rat 
in the confines of an alley (Amsel, 1972). 

The decrease in rate before reinforcement 
is similar to that found in all of the experi- 
ments reported thus far, and, as in those 
cases, is most likely due to terminal behaviors 
such as “hopper-tending” that are elicited 
by the temporal proximity to reinforcement. 
The decrease in rate before blackout is a 
more curious affair. Blackout will occur no 
matter where the animal is in the chamber, 
and there are no behaviors which might pos- 
sibly increase the value of that stimulus 
change, as hopper-tending might increase the 
amount of food the animals can obtain. 
Since pigeons tend to "roost" in the dark, 
it is possible that roosting behavior was con- 
ditioned to occur during the preblackout 
period. A more likely explanation is that 
the two intervals were confused by the 
pigeons, and the prereinforcement behaviors 
generalized to the preblackout period. These 
hypotheses are examined in the following 
two experiments. 


Experiment 3b 


The decrease in responding before black- 
out in Experiment 3a might have been due 
to the generalization of prereinforcement 
behaviors into the preblackout period. To 
test this possibility, the two periods were 
made quite discriminable : the houselight was 
turned off after 60 sec had elapsed since re- 
inforcement and was left off until the end of 
the next reinforcement. Thus, during the 
first half of the 120-sec interfood interval the 
chamber was normally illuminated, and dur- 
ing the second half it was blacked out. The 
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Ficure 5. Rate of panel activation in Experi- 
ment 3b before blackout (.0 to .5) and during black- 
out (.5 to 1.0). 


subjects and apparatus were the same as in 
the previous experiment. 

Results. Figure 5 shows that the clear 
differentiation of the intervals eliminated the 
downturn in the postreinforcement activity 
curve: The animals increased their activity 
up until the blackout (at 5096 of the in- 
terval) and then decreased it until the de- 
livery of reinforcement. Note furthermore 
that the pigeons were not stationary during 
the blackout, and that this much longer 
blackout did not elicit preblackout quiescence. 


Experiment 3c 


In the preceding experiments, blackouts 
interpolated within the interfood interval ap- 
peared to have various facilitative and inhibi- 
tory effects on the behaviors preceding and 
following them. Some of the effects were 
demonstrated to due to trace-discriminative 
control or to the generalization of prerein- 
forcement behaviors to the prestimulus pe- 
riod. Whereas there appeared to be an 
abundance of general activity in these ex- 
periments and a facilitation of activity by an 
interpolated blackout (in Experiment 3a), 
general activity has always decreased before 
food delivery. This seems inconsistent with 
observations that 


incentive-motivational stimuli are sufficient to in- 
crease general activity . . . [this] activity has all 


98 PETER KiLLEEN 


the marks of anticipatory excitement, such as that 
seen during classical appetitive conditioning — 
and that described by ethologists as 'appetitive be- 
haviors. (Bindra, 1968, pp. 3-4) 


The present experiment was conducted in 
an attempt to more clearly demonstrate 
“anticipatory excitement." A flickering 
houselight was used to signal the imminent 
delivery of food. This stimulus was less 
likely to elicit roosting, was long enough to 
permit a measure of activity during its oc- 
currence, and was more closely associated 
with food delivery than were any of the stim- 
uli used in the preceding experiments, 


Method. Two of the pigeons from the previous 
experiments were given 3-sec access to mixed 
Erain every 60 sec. Aíter seven sessions of these 
control conditions, the basic feeding schedule was 
continued for an additional seven sessions, but dur- 
ing the last 12 sec of each interval the houselight 
was pulsed at a rate of 10 Hz. 


Results and discussion. Figure 6 shows 
the experimental (squares and solid line) 
and control (circles and dashed line) data. 
The 12-sec conditioned stimulus inhibited ac- 
tivity immediately before reinforcement, but 
released from inhibition behaviors occurring 
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Ficure 6. Rate of panel activation in 
ment 3c under control conditions (dashed lj 


experimental conditions (solid line). 
began at 0.8. 


Experi- 
ne) and 
The CS 


earlier in the interval. "These somewhat 
paradoxical effects may be understood in 
terms formulated by Staddon and Simmelhag 
(1971) in their analysis of "superstitious" 
behaviors. 

Staddon and Simmelhag monitored the be- 
havior of pigeons who were given 3-sec ac- 
cess to food every 22 sec (cf. Experiment 1b 
of this review). They noted many forms of 
activity during the interfood interval, some 
of which followed time courses similar to 
those found in the present experiment and 
some of which started later in the interval 
and increased in rate as the moment of rein- 
forcement neared. The former behavior they 
termed “interim” responses, the latter “ter- 
minal” responses. Staddon and Simmelhag 
suggested that interim behaviors reflect a set 
of general activities instigated directly by 
reward. It is from this set of activities that 
a few appropriate behavior 
later be selected, via a mechanism such as the 
law of effect. Reinforcement thus has two 
aspects, "variation"— the generation of dif- 


fuse and generalized interim acti 
“selection” 


patterns may 


vities—and 
—the strengthening of particular 
behaviors selected by contiguity to reinforce- 
ment. Staddon and Simmelhag’s analysis 
and evolutionary model are fundamental to 
the behavior studied in this article, An 
analogue may be drawn between the rise of 
interim behaviors and the parameter I and 
between their displacement by terminal be- 
haviors and the parameter C, 
What terminal behaviors 
pected of a pigeon? 
Servation, and other 
either an innate or learned association with 
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analysis is not inconsistent with Bindra's 
(1968) theory of incentive-motivation. The 
terminal behaviors of dogs and rats, Bindra's 
reference subjects (Bindra & Palfai, 1967; 
Zener, 1937), were likely to be vigorous 
coursing or foraging responses that would 
contribute to measures of general activity. 
These and subsequent experiments lend con- 
firmation to Bindra's later (1970, 1972) de- 
velopment of incentive-motivation theory. 

General activity thus appears too gross a 
measure to be useful in comparing levels of 
motivation across species. Indeed, it will not 
do even for a single species: Levels of ac- 
tivity immediately before reinforcement in 
Experiment la were the same for the 400- 
sec and 25-sec conditions, but, by most ac- 
counts, animals in the former condition 
would be judged as more highly aroused. 
This presumed higher level of arousal is re- 
flected in a greater value of A for the shorter 
interval. Furthermore, while there was 
little difference in response rates on the 5-, 
10-, and 15-sec schedules of Experiment 1b, 
A continued to show a proportional increase 
with increases in rate of feeding (A= 
3,000/T, where T is the interfood interval), 
and so provides an index of "arousal" that 
is unbiased by "ceiling effects" on obtained 
response rates. It is possible, therefore, that 
the theoretical asymptote for activity, A, 
will provide a useful reification of the con- 
struct arousal within a species, and, with 
recalibration, may be of value in cross-species 
comparisons. 


Experiment 3d 


By providing a more accurate temporal 
reference point, signals of forthcoming rein- 
forcement conduce a more intense surge of 
either interim or terminal behaviors. In Ex- 
periment 3a, the blackout permitted a brief 
disinhibition of activity followed by a more 
intense suppression. In Experiment 3c, the 
effects were primarily inhibitory, though a 
finer analysis of behavior during the first few 
seconds of the condition stimulus might have 
uncovered some excitation. This last effect 
is somewhat curious, since in Experiment 1b, 
s of 5 and 10 sec showed marked 
f excitation. Judging from those 
there was ample time for the 


interval 
patterns 0. 
conditions, 


animals to move about the box during the 
12-sec conditioned stimulus (CS) in Experi- 
ment 3c. Was inhibition predominant in 
this experiment because the animals had al- 
ready had an opportunity, during the pre-CS 
period, to “displace” their energies? Or is 
the important variable in determining the 
predominance of interim or terminal be- 
haviors the relative, rather than the absolute, 
proximity to reinforcement (cf. Jenkins, 
1970; Staddon, 1972)? In Experiment 1 
the relative proximity of the signal (rein- 
forcement) to reinforcement was zero. Re- 
inforcement was the event most distant from 
the next reinforcement and thus inhibited the 
occurrence of both interim and terminal be- 
haviors. In Experiment 3a the relative 
proximity of the signal (blackout) was .5, 
and both interim and terminal behaviors 
were intensified. In Experiment 3c, relative 
proximity was .8, and only terminal be- 
haviors were observed. In the following ex- 
periment, the relative proximity will be set 
at .8, but the interval will be lengthened to 
300 sec. Will a signal with the same rela- 
tive proximity to reward, but with a much 
smaller absolute proximity, still incite only 
terminal behaviors? 


Method, The subjects had served in Experiment 
lb. Food delivery was scheduled to occur every 
300 sec and was independent of the behavior of 
the pigeons. The houselight was on and was con- 
stant until 60 sec before food delivery, at which 
time it was pulsed at a frequency of 10 Hz. After 
10 sessions, 10 control sessions were conducted in 
which food was delivered every 60 sec with the 
houselight constantly on. Finally, 10 additional 
sessions were conducted during which the house- 
light was off until 60 sec before food delivery and 
then was turned on and pulsed. 


Results and discussion, In Experiment 
3c, the houselight was pulsed during the last 
20% of the interval and during that time 
general activity was minimal. In the present 
experiment the houselight was on during the 
last 20% of the interval, during which time 
a familiar bitonic trend of general activity 
through the interval was observed (see Fig- 
ure 7, squares). An average of 15 panel 
activations occurred during the 60-sec con- 
ditioned stimulus, half of the rate under the 
control condition (dashed line). While the 
response rate during the conditioned stimulus 
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Ficure 7. Rate of panel activation in Experi- 
ment 3d under control conditions (dashed line), 
first experimental condition (houselight on — pulsed; 
light ITI, squares), and second experimental condi- 
tion (houselight off - pulsed; dark ITI, circles). 
was greater than the rate immediately pre- 
ceding the conditoned stimulus, it was lower 
than the average rate of activity during the 
240 sec preceding CS onset (21 responses 
per minute), 

When the houselight was turned off dur- 
ing the intertrial interval, responding was 
completely suppressed until the onset of the 
conditioned stimulus, and then it increased 
to a peak of 28 responses per minute ap- 
proximately halfway through the CS period 
(Figure 7, circles). The Shapes of the 
curves differed, with post blackout respond- 
ing showing the slowest rise to its peak, 


Discussion 


Relatively more activity was elicited by the 
conditioned stimulus in Experiment 3d 
than was the case in Experiment 3c. While 
the conditioned stimulus elicited more 


ity than might otherwise be expected during 
this segment of a 300-sec interfood interval, 
rate did not exceed the average rate through- 
out the rest of the interval. A more dra- 
matic effect was obtained when the chamber 
was darkened during the 240-sec intertrial in- 


activ- 


terval, whereupon rate during CS increased 
substantially. 

Evaluation of the present data in terms of 
average rates would leave the most important 
aspects unseen, A signal of forthcoming 
reinforcement is neither wholly inhibitory nor 
excitatory but has the now familiar multi- 
phasic effect on activity. Whether the signal 
is the shift to a flickering houselight or the 
delivery of food, immediately thereafter activ- 
ity is suppressed. This suppression occurs 
whether the conditioned stimulus is of 12-sec 
or 60-sec duration. An increase in activity 
may occur after the inhibition; such an in- 
crease was noticeable for the 60-sec CS— 
especially when the conditioned stimulus 
was the transition. from houselight-off to 
houselight-on—but might have occurred un- 
measured for the shorter duration CS. In 
all cases there was a decrease in activity im- 
mediately before reinforcement, presumably 
due to the evocation of competing terminal 
activities, 

The present series of experiments demon- 
strate that the course of general activity is 
strongly affected by the presence and nature 
of signals within the interfood interval, 
General activity is not a simple reflexive 
response to food delivery, for it is modified 
by signals of impending food—inhibited by 
signals with close proximity to food, disin- 
hibited and then inhibited more intensely by 
signals with moderate proximity to food. 
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and deprivation levels. To the extent that 
these perturbations have selective effects on 
response systems as modeled by Equation 1, 
they will be reflected in selective changes in 
the parameters of that model. To the extent 
that new processes are introduced by the ex- 
perimental operation, Equation 1 will fail. 


Method. Four White Carneaux pigeons served 
as subjects. They received 3-sec access to mixed 
grain every 60 sec; this schedule was independent 
of any behavior on their part. They were exposed 
to the following conditions in the order listed. 
Sessions were conducted daily, with two adminis- 
trations of each drug, one on Wednesday and one 
on Friday. Control data are from the Tuesdays 
preceding drug administrations and from the days 
preceding the two prefeeding sessions. 

Prefeeding. Each pigeon was given 60 grams of 
grain 1 hour prior to its normal session time. 
This amount equalled 12% of their experimental 
weight. After the first prefeeding session, the 
animals returned to their 80% weight, and addi- 
tional control sessions were conducted before the 
second prefeeding session, 

Caffeine. Each pigeon was given 10 mg of caf- 
feine (approximately 20 mg/kg) 1 hour prior to 
its normal session time. Route of administration 
was oral, 

Accepromazine maleate. Each pigeon was given 
12 mg/kg of the common animal tranquilizer 
acepromazine maleate (Anatran) 1 hour prior to 
its normal session time. Route of administration 
was oral. 

Scopolamine. Each pigeon was given .5 mg/kg 
of scopolamine 15 minutes prior to its normal ses- 
sion time. The scopolamine was injected intra- 
peritoneally. 


Results and Discussion 


The effects of the experimental manipula- 
tions on response rates were generally in the 
expected directions. Prefeeding greatly de- 
creased rate, while caffeine increased rate. 
Both the tranquilizer Anatran and scopol- 
amine brought about slight decreases in rate 
(see Table 3). Inspection of Figure 8 con- 
firms these effects. There also appears to be 
some change in the shapes of the curves, 
which may be analyzed by inspection of the 
parameters. 

Prefeeding the animals, which brought 
about a fourfold decrease in rate, left the 
shape of the curve unchanged from control 
conditions: While A decreased from 74 to 
19, C and T remained unchanged. This in- 
variance is important, for it suggests that de- 


creases in motivation due to satiation, as 
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TABLE 3 


RESPONSE RATE AND PARAMETERS OF 
Equation 1 FOR EXPERIMENT 4 


Re- 
Condition iiss A 6 I 
minute 
Control 48.0 73.9 | 1.61 | .10 
Prefeeding 12.9 19.0 | 1.61 | .10 
Caffeine 51.3 | 149.0 | .74 | 21 
Acepromazine maleate | 40.5 50.7 | 2.81 | .06 
Scopolamine 45.8 62.1 | 2.63 | .10 


might occur within the course of an experi- 
mental session, will have a linear effect on 
performance at all points within the interval. 
Furthermore, differential effects of drugs on 
C and I cannot be explained as a simple ap- 
petite effect, for a substantial prefeeding 
generates no such effects. 

The large dose of caffeine brought about 
only a small increase in rate of panel activa- 
tion. Inspection of Table 3 and Figure 8 
shows, however, that there were large effects 
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Ficure 8. Rate of panel activation in Experi- 
ment 4b. (For clarity of presentation, the top 
curve, caffeine, has been displaced upward by 20 
responses per minute, and the second curve, con- 
trol, has been displaced upward by 10 responses 


per minute.) 
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on behavior that are not reflected in an aver- 
age rate measure. The asymptotic rate, A, 
doubled, while the rate of approach to asymp- 
tote decreased. The activity then subsided 
at a faster rate than under the other condi- 
tions of this experiment, with C decreasing 
to less than half of the control value. 

The changes under caffeine may be in- 
terpreted as an intensification both of general 
activity and of the terminal behaviors which 
compete with general activity toward the end 
of the interval. The slower rate of approach 
to asymptote might be interpreted as an in- 
tensification of “initial” behaviors which 
compete with interim behaviors at the begin- 
ning of the interval. But there were no 
obvious initial behaviors apparent during 
casual observations of the pigeons under vari- 
ous conditions of these experiments. _ The 
resonant frequency under caffeine, 1//CI = 
2.52, was close to that of the control condi- 
tion, 2.49. 

The effects of acepromazine maleate were 
opposite to those of caffeine. Asymptote de- 
creased, rate of approach to asymptote in- 
creased, and rate of recovery decreased. 
Resonant frequency remained close to the 
control value, at 2.53. 

The effects of scopolamine were curious, 
There was a slight decrease both in average 
response rate and in asymptote. The rate 
of approach to asymptote remained at the 
control value, while the rate of recovery de- 
creased. This asymmetric change in the 
parameters caused the resonant frequency to 
decrease to 1.95. 

Scopolamine is known to cause a decrease 
in habituation to novel stimuli and a failure 
to inhibit incorrect responses in learning 
situations (cf. Carlton, 1969). The increase 
in C reflects a failure to inhibit general ac- 
tivity at a time when terminal behaviors 
might be more appropriate. The lack of 
effect on I indicates that scopolamine has 
no effect on the excitation of general activity 
but only on the excitation of specific terminal 
behaviors which compete with general ac- 
tivity. 

The behavioral changes brought about in 
this experiment are consistent with current 
intuitions concerning the nature of the ex. 
perimental manipulations, and thus they rein- 
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force both those intuitions and the use of the 
model for their quantitative assessment. The 
dissociation of changes in C and I effected 
by scopolamine may reflect different neuro- 
chemical bases for induced (interim) and 
operant (terminal) behaviors (cf. Margules 
& Margules, 1973). The invariances thaf 
were found—the constancy of the parameters 
under prefeeding and the constancy of the 
resonant frequency under all but the scopol- 
amine conditions—challenge the various in- 
terpretations of the model. These issues will 
be examined later in the General Discussion, 


ADJUNCTIVE BEHAVIORS 
Experiment 5a 


In training animals to perform for vaude- 
ville shows, Keller and Marion Breland 
(1961) noted that the form of trained oper- 
ant behaviors would often drift toward that 
of more species-typical instinctive behaviors 
and that instinctive behaviors would intrude 
into operant chains. A prototypical example 
may be found in sideshows today when one 
commands a chicken to “play a piano' 


serting a quarter into its “modified 
box." 


' by in- 
Skinner 
The performance is frequently inter- 
rupted by vigorous scratching of the floor of 
the box, much as if the animal were scratch- 
ing for seeds in loose dirt. This behavior 
was analyzed in the following experiment. 
Method, 


humbswitch, 
«l sec. 


Results. All chickens spent 
amount of time scratching the flog 
front wall of the chamber, Man ot rs 
Scratches were directed at the ho 4 i = 
ture. Figure 9 shows the tru aa, M 
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Ficure 9. Rate of scratching (seconds per min- 
ute) induced by reinforcement in Experiment 3a. 
(The curve was derived from Equation 10.) 


to a maximum about halfway through the 
interval and decreased thereafter. Equation 
1 accounted for 95% of the variance in the 
data, while Equation 10, which is represented 
in Figure 9, accounted for 98.6%. 


Experiment 5b 


Another set of behaviors induced by inter- 
mittant feeding has been labeled “schedule- 
induced aggression” (Azrin, Hutchinson, & 
Hake, 1966; Flory, 19692). This type of 
adjunctive behavior was studied in the fol- 
lowing experiment. 


Method. Hungry pigeons were placed in the 
chamber, each with an undeprived target pigeon. 
The experimental and target pigeons were sep- 
arated by a clear plastic partition 15 cm high, 
placed diagonally in the chamber. The experi- 
mental subjects were fed every 60 sec, independ- 
ently of their behavior. Pecks directed at the 
target bird were recorded by depressing a thumb- 
switch, which incremented the data bins every 


.1 sec. 

Results and discussion. During the last 
five sessions, one pigeon spent 9 sec out of 
every minute attacking the target pigeon, 
while the other spent 53 sec per minute in 
attack, Equation ] was fit to the first seven 
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data points and accounted for 97.7% of their 
variance (see Figure 10). 

Schedule-induced aggression follows a 
time course similar to general activity (con- 
trol data from Experiment la for these two 
pigeons appears as a dashed line in Figure 
10), both behaviors sustaining representa- 
tion as the sum of two exponential functions. 
Just as general activity is suppressed by the 
proximity to reinforcement, so too is aggres- 
sion. There was, however, a self-perpetuat- 
ing aspect to the aggressive behavior, with 
animals often becoming "locked into" a bat- 
tle, and in some cases missing food. This 
perseveration is evident in Figure 10, where 
aggression during the last 15 sec of the in- 
terval is seen to occur at a rate higher than 
predicted by the model. A less reactive 
target, such as a contact-sensitive mirror 
(Cohen & Looney, 1973) might provide a 
better preparation when the transactional 
aspects of behavior must be minimized. 

Both schedule-induced and extinction-in- 
duced aggression are manifestations of nor- 
mally occurring behaviors whose probability 
has been greatly increased by a free-food 
schedule. While such scheduling is main- 
tained, the temporal pattern of aggression is 
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Ficure 10. Rate of attack (seconds per min- 
ute) induced by reinforcement in Experiment 


5b. (Also shown is the rate of panel activa- 
tion for these two pigeons in Experiment la, dashed 
line; scale reduced by .75. The curves were derived 
from Equation 1.) 
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Ficure 11. 


Rate of lever pressing for water in 


Experiment 5c. (The curves were derived from 


Equation 11.) 


largely modulated by the proximity to rein- 
forcement and, when present, discriminitive 
stimuli signaling that proximity (Cole & 
Litchfield, 1969; Flory, 1969b). In extinc- 
tion these cues are absent, and both interim 
and terminal behaviors will blend and ex- 
haust themselves, possibly according to a 
simple exponential decay function (cf. Azrin 


et al., 1966; Clark, 1959, 1964). 
Experiment 5c 


When a rat is exposed to periodic feeding 
schedules it will consume large quantities of 
water in the interval between feedings (Falk, 
1972). Although similar patterns of exces- 
sive drinking have been noted in the rhesus 
monkey (Schuster & Woods, 1966) and 
pigeon (Shanab & Peterson, 1969) the in- 
take of a rat is most copious, approaching 
half of the animal's body weight in a 3-hour 
session. Explanations in terms of adventi- 
tious reinforcement and mediational func- 
tions have been ruled out (Falk, 1969). To 
determine whether the time course of sched- 
ule-induced polydipsia corresponds to that of 
the interim and terminal behaviors investi- 
gated in the previous experiments, rats were 


exposed to various feedings schedules with 
water available contingent on a lever press. 
In a later phase of the experiment, a wooden 
block was suspended in the cage, and rate of 
gnawing was measured. 


Method. Four naive albino rats, approximately 
100 days old, were exposed to feeding schedules of 
30-, 75-, 150-, and 300-sec according to a Latin 
square design. Each depression of a lever de- 
livered .08 ml of water to a reservior near the 
food cup; food was not delivered until 5 sec had 
elapsed without a lever press. After 15 sessions 
at each of the four feeding intervals, the lever was 
removed and a small block of pine wood Was sus- 
pended from a microswitch above „the ceiling. 
Manipulation of the wood sent a train of 10 Hz 
pulses to the counters for the duration of the ac- 
tivity. With the wood available, the food pellets 
were delivered every 75 sec, but not within 5 sec 
of any contact with the wood, After 12 sessions 
the water was reintroduced, and another 10 ses- 


sions were conducted, 
Results and discussion. 
polydipsia was observed; under all but the 
longest interfood intervals, the rats con- 
sumed several times over their normal daily 
intake of water. One of the four rats showed 
regularly increasing response rates up until 
the moment of reinforcement, despite the 
DRO contingency employed to minimize 
Superstitious behavior (cf. Segal, 1969). 
Its data are deleted from further analyses. 
The behavior of the other three rats is well 


represented by the averages shown in Figure 


11. Overall rate of lever pressing for water 


Was greatest for short intervals and de- 
creased as the interval was lengthened (see 


Schedule-induced 


TABLE 4 


EXPERIMENTAL CONDITIONS, AVERAGE REsp 


ONSE 
RATES, AND PARAMETER VALUES FOR 
EQUATION 11 IN EXPERIMENT 5c 
Conditi a | We tae 
onditio; Sponse 
ition spon: H S |M 
Feeding condition 
30 sec 9.5 | 14.8 | 26 56 
75 sec 6.1 8.3 | .29 | .50 
150 sec 4.3 5.1 | .33 | .63 
300 sec s] 3 | 75 | 51 
Drinking (wood present) | 3.3 TA T27 36 
Gnawing (water absent) 2.2 4.2 30 E 
nawing (water present) 29 13 | 23 50 


Note. Response rates are measured a 
minute or seconds gnawin, 


3 lever press 
i per minute, Presses per 
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Table 4). Lever pressing was maximal mid- 
way through the interval, with the variance 
around .55 showing no systematic relation to 
the interpellet interval. The curves through 
the data points of Figure 11 are normal 
(Equation 11); values of the parameters 
are listed in Table 4. 

When the pine block was introduced, a 
considerable amount of gnawing took place. 
Its time course was similar to that of sched- 
ule-induced polydipsia. When the water was 
reintroduced, gnawing decreased substan- 
tially, with the maximum rate occurring 
earlier in the interval. Rate of drinking in 
the presence of the wood was depressed, with 
the maximum rate occurring later in the 
interval. 

Schedule-induced polydipsia is often char- 
acterized as a “postpellet phenomenon." 
The present experiments belie that descrip- 
tion and show that, while there is some post- 
pellet drinking, the maximum rate of drink- 
ing occurs halfway through the interpellet 
interval. When a block of pine wood is 
available to the rats, they divert some of the 
time that would have been occupied drinking 
water to manipulation and chewing of the 
wood, with wood gnawing tending to occur 
slightly earlier in the interval than water 
drinking. 

Both schedule-induced polydipsia and 
gnawing are reasonably well described by 
normal curves which account for approxi- 
mately 94% of the data variance. As rate 
of reinforcement increased, the maximum of 
the curves (H) increased, the standard de- 
viation (S//2) decreased, and the midpoint 
(M) remained unchanged. 

This was the first time Equation 11 was 
invoked to describe data. As the number of 
steps in a series latency mechanism increases, 
the resultant probability density function ap- 
proaches the normal. This follows from the 
central limit theorem. By the same taken, 
the normal distribution is the limiting form 
of many different types of mechanisms which 
might be hypothesized to underlie the ob- 
tained distributions. While the obtained 
data might have modeled with four- and 
five-step general gamma processes, there 


would be little point to the exercise, for so 


TABLE 5 


FIXED-INTERVAL AND PARAMETER VALUES FOR 
EQUATION 11 For DATA FROM 
Various SOURCES 


Fixed i 
Source teal H S M 
(sec) 
Hawkes and Shimp 
(1975) 5 | 180.0 | .566 | .83 
Catania and Reynolds 30 | 106.0 | .488 | 1.21 
(1968) 50 83.3 | .488 | 1.06 
200 46.5 | 488 | .81 
Experiment 2a 60 89.4 | .406 | .92 
Dukich and Lee (1973) 60 55.0 | .347 | 1.05 
120 55.0 | .428 | 1.24 


many parameters would dilute conviction in 
any interpretation of that model. 

Why is the normal distribution, rather 
than a two- or three-step general gamma dis- 
tribution, appropriate for these data? I sus- 
pect the answer lies in the specificity of the 
measured response. Where it is quite gen- 
eral, as was the case for the experiments re- 
ported in the first part of this review, early 
steps in the gamma process are likely to be 
captured as measured responses. While 
subsequent responses may be chained to those 
early responses and appear normally dis- 
tributed, it is the earliest behaviors, with the 
shortest time constants, that contribute most 
heavily to the final shape of the distribution. 

As the set of measured behaviors becomes 
more specific, we should expect a sharpening 
of the distribution. If the behaviors occur 
early in the interval, they should be unalloyed 
two- or three-step gamma processes. If 
they occur late in the interval, with many in- 
tervening steps in the chain, they should be 
best described as normal distributions. In 
the remainder of this article we examine 
specific responses occurring late in the in- 
terval and determine the shape of the dis- 
tribution which best describes them. 


APPLICATION OF THE MODEL TO OPERANT 
AND RESPONDENT BEHAVIORS 

It was noted that one of the four rats in 

the previous experiment behaved atypically, 

ingesting water at an increasing rate through 

the interval. Other investigators have noticed 

that “schedule-induced” drinking may shift 
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Ficure 12. Rate of ke: 


y pecking by pigeons on 
various fixed-interval schedules. (The top curve, 


fixed interval 20 sec, is displaced upward by 40 
responses per minute, while the middle curve, fixed 
interval 50 sec, is displaced upward by 20 responses 
per minute. From Catania & Reynolds, 1968.) 


toward the end of the interval and take on 
the "characteristic form of a fixed-interva] 
scallop.” Segal (1972) has suggested that 
this shift may be characteristic of the genesis 
of many operant performances— "induction" 
of some response as an interim behavior 
consequent increase in the probability of c 
tiguity with an ensuing reinforcer, and even- 
tual entrapment as a reinforced, operant re- 
sponse. This shift in the contro 
tingencies should be reflected in a 
mode and shape of the response d 
In the previous section we descri 
tribution of induced behaviors w 
curve. What is the nature of 
tion of the operant behaviors: What is the 
mathematical form of a “typical fixed-in- 
terval scallop"? In the following para- 
graphs we select from the literature data on 
which to base an answer. 

In a monograph published in 1968, Catania 
and Reynolds reported an extensive study of 
responding by pigeons on variable-interya] 
and fixed-interval schedules, 


i Figure 12 
shows the rate of responding as a function 


»a 
on- 


lling con- 
shift in the 
istribution, 
bed the dis- 
ith a normal 
the distribu- 


of time through the interval, for fixed in- 
tervals of 30, 50, and 200 sec. The curves 
through the points are normal with param- 
eters given in Table 5. Shown in Figure 13 
are data from Hawkes and Shimp (1975) 
for pigeons responding on an fixed-interval 
5-sec schedule. Although these data were 
collected by different investigators and were 
generated on a very short fixed.interval 
Schedule, the same normal function. which 
fit Catania and Reynolds's (1968) data is 
also appropriate for Hawkes and Shimp. 
The performance of rats on fixed-interval 
schedules is similar to that of pigeons. Fig- 
ure 14 shows data collected by Dukich and 
Lee (1973) studying the responding of rats 
on fixed intervals of 60 and 120 sec. The 
parameters are listed in Table 5. 

In all cases, behavior is well represented 
by the left tail of a normal curve, which ac- 
counts for 99.5% of the data variance, The 
present analysis suggests that the form of the 
typical fixed-interval scallop—the smoothly 
accelerating curve drawn by a cumulative 
recorder—is a cumulative normal function, 
We now turn our attention to the param- 
eters which describe that function, 
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In Figure 15 I have plotted the product of 
M and T as a function of T. M may be 
taken as a psychological judgment of the 
duration of the interval, and multiplication 
by T returns it to a nonnormalized form. 
The data from Experiment 2a and from 
Catania and Reynolds (1968) fall along a 
straight line with a slope of .8. The data 
from Hawkes and Shimp (1975) fall be- 
low that line.* On the strength of the former 
data, we suggest that the psychophysical scale 
of duration for pigeons is a power function 
with an exponent of .8. Catania (1970) 
came to the same conclusion from an analy- 
sis of behavior on other reinforcement sched- 
ules, as did DeCasper and Zeiler (1974). 
The fractional slope of the function indicates 
that pigeons typically overestimate short in- 
tervals and underestimate long ones. "This 
tendency is reflected in the truncation of the 
behavior during a period of accelerating re- 
sponding on short fixed-interval schedules, 
and during a period of decelerating respond- 
ing on longer schedules. For the present 


? The differences which exist, such as the larger 
value of S, the smaller value of M, and the larger 
intercept at / — 0, may be due in part to the use of 
a 2,9 sec time-out after reinforcement. 
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Ficure 15. Top line: The theoretical maximum 
number of responses induced by food as a function 
of the interfood interval. (Data are from Catania 
& Reynolds, circles; from Experiment 2a, squares; 
and from Hawkes & Shimp, triangles. The slope 
of the line is .6.) Bottom line: The relation be- 
tween interfood interval and midpoint of the normal 
curve (M). (The data from Catania & Reynolds, 
circles, and from Experiment 2a, squares, fall along 
a straight line with a slope of .8, while the data 
from Hawkes & Shimp, triangles, fall below that 
line.) 


data, estimation is most accurate at a fixed- 
interval value of about one minute, where 
reinforcement interrupts pigeons at their 
maximal rate, 

The parameters for the rats are not shown 
in Figure 15. Their behavior was markedly 
different from that of pigeons—the maximum 
rate, H, remained the same for FI 60 and FI 
120, while M increased, rather than de- 
creased with T. Since these data are from 
the control conditions of but one study, they 
should not be weighed too heavily. If they 
are replicated, these qualitative differences 
may implicate an important distinction drawn 
by Seligman (1970) between "prepared" re- 
Sponses such as pecking, induced by and 
closely bound to reinforcement, and "unpre- 
pared" responses, such as lever pressing, 
that are more loosely coupled to reinforce- 
ment. 

The existential status of the right, descend- 
ing limbs of the truncated normal curves is 
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uncertain. When reinforcement is only oc- 
casionally withheld on fixed-interval sched- 
ules, rate does not usually drop off as quickly 
as would be predicted by the present formula- 
tion (cf. Staddon & Innis, 1969; Zimmer- 
man, 1971). A higher ordered exponential 
equation would provide the necessary "asym- 
metrical normal curve" shape. Unfortunately 
the necessary data have not been collected to 
evaluate such a model. When reinforcement 
is withheld on a more regular basis (e.g., 
Catania, 1970; Myerson, 1972), response 
rate does drop off in a symmetric, normal 
fashion. Equation 11 therefore seems ap- 
propriate for well-learned behaviors such as 
fixed-interval responding and discriminated 
extinction. Other models, such as a simple 
exponential decay function, may apply to 
nondiscriminated extinction (Clark, 1959, 
1964; Furumoto, 1967). 

'To calculate the maximum number of re- 
sponses evoked by a reinforcer, one inte- 
grates the normal curve from — ~ to + co 
and finds it to be, quite simply, HSz. In 
Figure 15 I have plotted HS as a function 
of T. The equation 


HSr = kT’, [12] 


whose locus appears as a solid line, repre- 
sents well Catania and Reynolds’ (1968) 
data (circles), Experiment 2a data (square), 
and Hawkes and Shimps’ (1975) data 
(triangle). It should also be mentioned that 
the unnormalized value of S is approximately 
proportional to T, and H is a power function 
of T with an exponent of —.4. It is not yet 
clear which of these relations will turn out 
to be fundamental, and which derivative, 
To predict the average obtained response 
rate on fixed-interval schedules, one must 
multiply HSz by the area under the curve 
between ¢ = 0 and £ = 1 and then divide by 
T. Figure 15 shows us, however, that the 
normal curve was centered around 1 only for 
fixed intervals of approximately 60 sec: For 
shorter schedules, the animals overestimated 
the time, thereby truncating the normal 
curve before maximum rate was attained. 
As the interval grows smaller, less and less 
of the maxium possible rate is attained. This 
attenuation might be termed “inhibition by 
reinforcement” (cf. Gormezano & Moore 
1969; Hovland, 1936). The mechanism of 
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such “inhibition” is simply the nonlincar 
psychophysical scale of duration demon- 
strated in Figure 15. 

The present model throws additional light 
on some regularities in behavior noted by 
Nevin (1974). Nevin measured response 
rates on two alternating variable-interval 
schedules in the presence of explicit compet- 
ing responses (a fixed-interval schedule on 
a second key). Over a range of different 
reinforcement probabilities for the variable 
interval schedules, he found 


Ri =q Rë, [13] 


rate on one schedule was a power function 
of rate on the other. Rates on the variable- 
interval schedules were greatest at the begin- 
ning of the fixed interval and decreased as 
the interval progressed and as fixed-interval 
responding became predominant. Their time 
course looked much like that of interim be- 


haviors (Figure 1); the analogy has some 
face validity and will be examined. 


Let us simplify Equation 1 by ignoring the 
inhibition of reinforcement : 


R; = Aje-'/Ci, [14] 


Response rates on each of the two variable- 
interval schedules should approximate ex- 
ponential decay functions of time through 
the interval. To determine the relation be- 
tween response rates, we may compare rates 
only at the same point in time and thus 
collapse Equation 14 over 1: 
-CiIn(R;/A,) = 

t = —Cs In(Re/Ag), 
Rearrangement yields 


Ri = a(R 


[15] 


[16] 


Ag 
a form equivalent to Equation 13, with 
s= O/C, nz] 
and 
1= Ai/ Ag, [18] 
While N 


lows Tam d (1974) Equation 13 thus fol- 
quantitati * present theory, independent 
Ris peni Predictions of his data are diffi- 
_ i ip the present analysis provides 

asis tor predicting the nature of changes 
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in C with changes in the probability of rein- 
forcement for interim behaviors (ie., vari- 
able-interval schedule responding). If we 
assume, however, that A is proportional to 
the rate of reinforcement, we may use 
Nevin's values of s in Equation 18 to pre- 
dict his obtained values of g. The unknown 
constant of proportionality enters Equation 
18 in a nonlinear fashion, causing the pre- 
diction for individual animals to be in error 
by the factor k“, Nonetheless, approx- 
imate prediction is possible, and the scatter- 
gram of obtained versus predicted intercepts 
(Figure 16) shows it to be effective. 

The relevance of the present analysis to 
Nevin's data is important in light of the as- 
sumptions that were made in the course of 
the analysis. Key pecking on multiple vari- 
able-interval schedules, maintained by contin- 
gent reinforcement, was successfully treated 
as an "interim" behavior. Such responding 
shares with interim behaviors the same rela- 
tively homogenous probability of reinforce- 
ment over time, and, like interim behaviors, 
suffers competition from periodically occur- 
ring reinforcers. The use of Equations 1 
and 10 should not, therefore, be thought to 
be restricted to schedule-induced, or interim 
behaviors but may be taken as a model of the 
changes in any tonic, or background, be- 
havior in the context of periodic occurrences 
of reinforcers which elicit competing be- 
havior. 


Application to Respondent Behavior 


While the paradigms which generated 
much of the data in the present experiments 
would be classified as respondent, or classical 
conditioning, it is useful to look at a more 
traditional instance of respondent behavior 
from the perspective of the present theory. 
Pavlov (1960; pp. 95, 96) reported several 
experiments involving "inhibition of delay" ; 
conditioned stimuli of 3-min duration were 
presented at intervals of 12 min. The un- 
conditioned stimulus was the injection of a 
weak acid solution into a dog's mouth. Al- 
me of the trials involved compound 
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Ficure 16. Obtained versus predicted values of 
q, the intercepts of Nevin's (1974) power functions. 
(The predictions followed from Equation 18, with 
A assumed proportional, k = 150, to the reinforce- 
ments per minute. The predictions are for indi- 
vidual subjects from each condition, with the ex- 
ception of Pigeon 60 Condition 4, for whom no 
slope could be determined.) 


with a mean of .95 and a standard deviation 
of 43 accounts for practically all of the data 
variance. 

Equations 10 and 11 are also reasonable 
descriptors of the behavior occurring during 
much shorter conditioned stimulus-uncon- 
ditioned stimulus intervals. Smith (1968) 
classically conditioned the nictitating-mem- 
brane response of rabbits, employing in- 
tervals between 125 and 1,000 msec. Third- 
order exponential functions described the 
data from the three shortest intervals and 
a normal curve described the data from the 
longest interval. Approximately 95% of the 
data variance was accounted for by those 
models.* 


GENERAL DISCUSSION 


The study of the general activity of pigeons 
during the interval between feedings gener- 
ated regular data conducive to representation 


3 An additional transformation was necessary to 
permit these models to fit Smith's data: 90 msec 
were arbitrarily subtracted from the abcissa before 
Equations 10 and 11 were fit. This latency should 
be treated as an additional free parameter and 
might be interpreted as an estimate of the absolute 
refractory period. 
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Ficure 17, Rate of salivation (drops per min- 


ute) as a function if time within a 3 min condi- 
tioned stimulus (data from Pavlov, 1960). 


by a simple mathematical model (Equation 
1). The two exponential decay functions of 
the model were identified as a short-term 
inhibitory process reflecting the remoteness 
of one food delivery from the next and a 
longer term process reflecting the rise of 
competing ("terminal") behaviors, 
Whereas this simple model provided an 
adequate account of general activity, the 
addition of a third exponential was necessary 
to accommodate the interim behaviors re- 
ported by Staddon and Simmelhag (1971), 
and it improved the fit to some other data. 
This third, very fast exponential permitted 
the early portions of the curves to show posi- 
tive acceleration and was attributed to a 
carry-over of consummatory responses or 
orienting responses into the beginning of 
short intervals. Although responses Occur- 
ring toward the middle and end of an in- 
terval could be represented by three or four 
interacting exponentials, the normal function 
required fewer parameters and provided 4 
superior fit to the data. The normal curve 
is the limiting distribution when many ex. 
ponential decay functions interact, 


Changes in the parameters of the model 
showed a gratifying concordance with changes 
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in independent variables. The theoretical 
maximum, A, was proportional to the feed- 
ing rate, while H was a power function of 
the feeding rate. The time constants of the 
component exponential processes showed 
regular changes with changes in rate of 
feeding and a consistent (inverse) relation- 
ship with each other. The parameters of 
the normal curve were cither relatively in- 
variant or changed smoothly with changes 
in the interfood interval. 

Equations 1 and 10 may be interpreted as 
the impulse response curves of second and 
third order control systems. In this con- 
struction, response rate is presumed propor- 
tional to the deviation of the system from 
homeostasis, This deviation is brought 
about by an impulse—in the present case, 
food delivery. It may be assumed either 
that the measured response drives the system 
back to equilibrium (a traditional Servo-sys- 
tem with proportional negative feedback) or 


simply that the return occurs as an exponen- 
tial function of time, 
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eters! Further experiments should be con- 
ducted before a cybernetic interpretation is 
discarded. But one of the most severe im- 
pediments to this construction is its inter- 
pretation of a low response rate before food 
delivery as a return to equilibrium. If that 
feeding were omitted, response rate would 
soon increase to record highs, a phenomenon 
beyond the purview of simple control theory. 
«Although such a rebound might be accom- 
modated by control systems with predictive 
elements, the complexity of those models is 
not justifiable with the present limited data 
base. 

An unramified stochastic process, such as 
was presented earlier, fares little better than 
control theory as an interpretation of the 
fundamental equations. The constancy of 
the number of inferred parallel processes in 
Experiments la and 1b was encouraging. 
It is possible that simple modification of the 
organism's environment, such as a change in 
chamber size, will have direct effects on the 
index. It is possible that the steps in the 
chain will be correlated with the perception 
of and reaction to environmental stimuli and 
thus tie the present model into stimulus- 
sampling theories of behavior acquisition 
(Estes, 1959; Kintsch, 1970). 

But there are problems with the simple 
stochastic interpretation. The number of 
hypothetical processes was constant in no 
other parts of these experiments, and the 

` changes that occurred under drugs were 
unenlightening. Especially damaging is the 
constancy of the product of C and I over 
large changes in feeding rate and insult by 
various drugs. This powerful invariance 
in the data would go unnoticed for there is 
no index in series latency mechanics which 
involves the product of C and I. Finally, 
the construal of individual responses from 
homogenous sets of behavior, such as pacing 
or drinking, as independent outcomes of 
parallel processes seems unsympathetic to 
the data. 

General activity comprises many behaviors 
"which hang together as unitary phenomena. 
The stochastic model may be modified to pre- 

serve this molar structure of the data. As- 
sume the gamma distribution represents a 
"behavioral state of the organism during 
which time à particular activity, such as 


DET 


polydipsic drinking, occurs at a uniform rate. 
lt is called a "behavioral" state to avoid 
connotations of a “drive” state, such as 
"thirst." Let us assume the presence of an- 
other state, represented by a gamma dis- 
tribution with the same values of A and I 
but a different value of C (C’). Correlated 
with this state are other behaviors, such as 
running in a wheel, which are suppressed 
(“displaced”) by the behaviors of the first 
state. The probability of being in the second 
state at a given point in time (and therefore 
the average rate at that time) will be equal 
to the difference of the two equations: 


R -— A(et/c — eon, [19] 


This behavioral state development seems 
most in accord with observed behaviors. 
Furthermore, it explains quantitative prop- 
erties of data collected by Staddon and 
Ayres (in press) and analysed by them and 
by Staddon (in press): in a state theoretic 
manner. Their analysis, which motivated 
the above development, showed that the 
duration of one activity (e.g, running) 
was negatively correlated with the time of 
offset of a previous activity (e.g., drink- 
ing). They took the significant negative 
correlations, along with data from experi- 
ments in which one of the activities was 
prevented, as evidence against a simple 
chaining interpretation (i.e., drinking leads 
to running) and for central timing of each 
of the states. Inspection of Equation 19 
shows that the total amount of time in a 
state will indeed be negatively correlated 
with variations in the time constant govern- 
ing entry into that state (C'). The machin- 
ery of generalized gamma processes may 
thus be retained, but the emphasis upon the 
termination of a state is now shifted to the 
behavior occurring during a state. The 
“latent period” is revealed by the measured 
responses, which are taken to signify the out- 
come of competition among parallel be- 
havioral states, 

There are implications to the present re- 
search that go beyond the provision of an 
adequate model for the time course of be- 
havior and the leverage on related problems 
which that provides, Four historically dis- 
tinct phenomena—general activity, schedule- 
induced behaviors, operants, and respondents 


2 


-have been encompassed by a single mathe- 
iatical formulation. While there are quali- 
tive distinctions among these various be- 
aviors, they have a common foundation in 

\quations 1, 10, and 11. If future research 
hows that multiple behaviors within an in- 
erval share some parameters (e.g., A, cor- 
ected for response units) and exert predict- 
ble effects on the parameters of other be- 
viors (as the introduction of an operant 
n Experiment 2a increased the value of C), 
he theoretical usefulness of those distinc- 
ions will fade. 

What then will remain as important dis- 
riminanda among behaviors? Operants are 
aid to be emitted while respondents are 
licited. But "emission" cannot be taken to 
mply lack of prior causal stimuli (Staddon, 
973); all operants must be "elicited" by 
stimuli of some sort. If an external dis- 
criminative stimulus is not provided, time 
will serve. And time serves operants and 
respondents similarly, for we have seen that 
the same mathematical function of time de- 
scribes both types of behavior. 

Respondents are said to extinguish more 
quickly than operants and to suffer greater 
decrement under conditions of partial rein- 
forcement. But that distinction is con- 
founded by the historic difference between 
the response systems studied within the op- 
erant and respondent paradigms. When 
similar systems are studied, similar partial- 
reinforcement effects may be found (eg, 
Williams & Williams, 1969). In the experi- 
ments modeled in this article, many key 
pecks and many drops of saliva occurred 
during the intervals between feedings, Which 
was on a thinner schedule of partial rein- 
forcement is moot. The same equation de- 
scribes the time course of each and provides 
theoretical asymptotes for each. Only when 
the scale factors for various behaviors re- 
ceive relative calibration (e.g., by the mea- 
surement of concommitant changes under ma- 
nipulations of drugs, deprivation, or amount 
of reinforcement) can meaningful compari- 
sons be made. And only then can the ap- 
parent differences in resistance to extinction 
in díscrete-trlal experiments be fully under- 
stood. 

Some behaviors are more malleable than 
others; typical respondents are not so readily 
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shaped with differential reinforcement and 
might therefore justify distinction from op- 
erants. But the Brelands (1966) have 
alerted us to the limitations on our ability 
to shape operant behaviors. Moore (1973) 
suggests that the topography of operants is 
closely related to forms of behavior induced by 
"reinforcement" ; contingencies merely change 
the location of induced behaviors. Hearst and 
Jenkins (1975) provide a possible mech- 
anism for that spatial control in their analysis 
of "sign-tracking" (cf. Neuringer & Neu- 
ringer, 1974; Schneirla, 1959). Certainly 
more research is necessary, but the presently 
available data suggest that sign-tracking, 
schedule-induced arousal, and the present 
model of temporal control may provide the 
basic "laws" for a theory of behavior more 
powerful than any founded on the law of 
effect. 

The asymptotic level of general activity, 
schedule-induced behaviors, and operants in- 
creases with increases in the frequency of 
reinforcement. This systematic change is 
taken as evidence for an intervening state of 
“arousal.” The construct has much in com- 
mon with the "anitcipation-invigoration 
mechanism" of Cofer and Appley (1967), 
rer "s (1972), the “central- 

a indra (1972), and the 


“arousal” of Wayner (1970), Valenstein 
Cox, and Kakolewski (1970), Glickman 
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the present theory distinguishes “normal,” 
“operant,” or “terminal” behaviors from “in- 
terim,” “adjunctive,” or “displacement” be- 
haviors? Two aspects of the incentive- 
motivator will account for the distinction : 
(a) The incentive is available only at the 
end of the interval; behavior appropriate to 
it will therefore be elicited toward the end 
of the interval, and only then; and (b) Ade- 
quate contact with the incentive will satiate 
the organism, thereby turning off the 
arousal.* 

Elements of the present theory have been 
suggested before (cf. Bindra, 1972, Staddon, 
1972, on temporal versus situational control ; 
Moore, 1973, on Pavlovian mechanisms in 
operant behavior; Fentress, 1973, on the 
nonspecificity of the central motive state of 
arousal). They may seem too simple and 
incomplete to account for the data under 
purview; but consider: The theory suggests 
that a state of elevated arousal is generated 
by events traditionally referred to as rein- 
forcers. The arousal increases the vigor of 
all responses elicited by situational stimuli, 
both interoceptive and exteroceptive, acting 
in concert with temporal stimuli. The elicita- 
tion of behavior is quite general, with those 
behaviors favored by the organism’s phylo- 
genetic or ontogenetic history and present 
drive condition being predominant. Since 
many of these behaviors do not abate the or- 
ganism’s arousal, they may continue at “ex- 
cessive” levels. But as time wears on, be- 
haviors appropriate to the appeted stimulus 
will increase in frequency according to equa- 
tions such as 10 and 11. All of the “fine 
control” of behavior attributed to law of 
effect strengthening may be reducible to the 
time-contingent elicitation of approach to 
stimuli that are correlated with reinforce- 
ment. 

The present theory repudiates the tradi- 
tional distinctions between classes of be- 
havior and questions the future usefulness of 
the law of effect. That law was historically 
necessary, for it established “reinforcement” 
as a firm temporal reference point from 
hich to view the immediately preceding 


w = - 
behaviors. With the present 
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4This feature is missing when the reinforcer is 
intracranial stimulation of the brain. 
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model of temporal control, we may trace the 
course of all behaviors that are referenced to 
signals of forthcoming incentives, such as 
feedings. “Reinforcement” and its retroflex 
action may be dispatched. 
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In the mid 1960s, R. J. Douglas and K. a 
Pribram presented a series of experiments 
detailing the effects of hippocampal lesions 
on problem-solving behavior. The results of 


these experiments were accounted for by re- 
course to the concept of experiencing a 
change in awareness which was interpreted 
as indicating the involvement of “attention.” 
A continuing interest in problems of atten- 
tion has been reflected not only in the work 
reported from our laboratory (a dozen anda 
half papers) but also in that of many others 
(see reviews by Mostofsky, 1970; Horn and 
Hinde, 1970; and Kahneman, 1973). Re. 
search has largely been directed to two gen- 
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arousal have been studied extensively and 
have been labeled by Berlyne (1969) as col- 
lative variables. These include sudden 
changes in intensity to which the organism 
is unaccustomed, changes in the timing of 
inputs, and changes in the ground in which 
a stimulus figure appears, in particular when 
input is scarce, surprising, complex, and 
novel. Such collative characteristics also 
define the concept "information" as it is used 
in the study of communication systems (see 
e.g., Brillouin, 1962) ; thus it is possible to 
treat organisms subject to arousal as “in- 
formation processing systems," where input 
is matched against some residual in the or- 
ganism of his past experience, or some com- 
petence (Miller, Galanter, & Pribram, 1960; 
Pribram, 1971). Without such matching, 
there could be no novelty or information nor 
even a measure of change in intensity. 

This knowledge has led to behavioral re- 
search which has been addressed to such is- 
sues as adaptation level (Helson, 1964), 
expectancy (Bruner, 1957) and the develop- 
ment of “neuronal models” (Sokolov 1960, 
1963), while neuroscientists were able to 
demonstrate the occurrence of a permanent 
or semipermanent modifiability of neural tis- 
sue (see Pribram and Broadbent, 1970; 
Horn and Hinde, 1970). 


Neural Modifiability 


Sharpless and Jasper’s (1956) and Soko- 
lov's (1963) classical studies paved the way 
for an analysis of the requisites for produc- 
ing behavioral and physiological arousal re- 
sponses. Especially significant was the im- 
plication of information processing systems 
by Sokolov's findings (1963) that any 
change in a repetitively presented stimulus 
configuration would elicit responses even 
when that change was a shortening of the 
sensory input or the lowering of its intensity. 
Sokolov concluded that the organism's sensi- 
tivity was not just a measure of its threshold 
but rather that sensitivity was a function of 
a match between input and some competence, 
a neuronal "model," or a patterned memory 
trace developed in the brain as a representa- 
tion of the experienced stimulus configura- 
tion, Any mismatch produces orienting. 

Many investigators concerned with modifi- 


ability of neural tissue as a consequence of 
experience seized on the habituation of an 
initial neural or behavioral arousal reaction 
as an extremely simple test paradigm that 
suited this purpose. Although these efforts 
and results have been reviewed by Horn and 
Hinde (1970), we will highlight certain is- 
sues that are relevant to this presenetation. 
An approach initiated by Spencer and 
Thompson (see Groves and Thompson, 
1970) involved recording from the spinal 
cord and illustrated that neither sensory af- 
ferents nor motor efferents accounted for 
habituation of spinal reflex behavior. Fur- 
ther, Thompson discerned two primary sys- 
tems, two classes of neurons; those that re- 
sponded more slowly and incremented for 
relatively long periods before decrementing. 
and those that rapidly decremented their ac- 
tivity with repetition of the stimulus. The 
incrementing “monitoring,” “sameness,” or 
“arousal” neurons lie more ventrally and 
medially in the region of the origin of the 
visceroautonomic outflow, while the decre- 
menting neurons lie, for the most part, in the 
dorsal horn of the spinal cord. Convergence 
of these two systems of neurons was as- 
sumed to account for the activity of a third 
population of (“novelty”) neurons whose 
response characteristics were identical to 
those demonstrated by the reflex: arousal 
reaction, habituation, and dishabituation. 
Thompson’s parallel processing model of 
arousal, habituation and dishabituation, 
shared by many other investigators (e8. 
Horn, 1970; Segundo & Bell, 1970 ; Spencer 
& April, 1970) is simpler than Sokolov's as 
it depends only on spatial summation of 
arousal and decrementing. No "neuronal 
model" of the input is developed, only a 
decrementing to repetition and an “arousal” 
proportional to the strength of the input. 
The model does not account for Sokolov’s 
findings that in the intact organism dis- 
habituation occurs when stimulus duration is 
shortened or stimulus intensity is diminished. 
These phenomena imply a hierarchical pro- 
cess involving storage and subsequent match- 
ing of input to store. Horn (Horn & Hinde, 
1970) has developed a scheme whereby such 
a hierarchical process can be realized without 
any unnecessary assumptions that go beyond 
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the data obtained by practically everyone; 
namely that the decrementing observed is 
due to self-generated synaptic depression and 
not to the development of inhibition (inhibi- 
tory postsynaptic potentials characterized in 
intracellular recordings by hyperpolariza- 
tion). 

This does not mean, of course, that inhibi- 
tion fails to occur in many systems. The 
Sensory and motor pathways are replete with 
circuits that lead to both lateral (i.e., neigh- 
borhood) and self-inhibition. As we shall 
see shortly, habituation becomes organized 
into patterns in these pathways whether by 
the mechanisms Proposed by Horn or by the 
action of inhibitory circuits, or both. 


Core Brain Arousal System 


The rostral extension into the mesen- 
cephalic brain stem of a column of medially 
placed cells accounts for the well documented 
arousal effects of stimulations of the reticular 
formation (see Lindsley, 1961, and Magoun, 
1958 for review). Less well known is the 
fact that such effects are obtained even more 
rostrally in the diencephalon in a continua- 
tion of this neuron system into the hypo- 
thalamus. 

The contribution of hypothalamic viscero- 
autonomic mechanisms to the reflexive as- 
pects of the orienting reaction has been 
noted by Feldman and Waller (1962). The 
episodes of fighting and fleeing that are pro- 
duced by electrical or chemical stimulation 
to the so-called “defense” region of the hy- 
pothalamus can be related to the orienting 
reaction. Abrahams and Hilton (1958) and 
Abrahams, Hilton, and Zbrozyna (1964) 
found that, in attempting to produce a de- 
fense response by stimulation of the hypo- 
thalamus, at first, a much lower degree of 
arousal occurred, indicated by pupil dilatation 
and postural alerting. Only when the level of 
stimulation was increased and maintained for 
a few seconds did hissing, snarling, running, 
and piloerection occur. In the later study, 
alerting behaviors were measured in greater 
detail, and during mild _ Stimulation the 
authors observed changes in pupil dilation, 
head movement, pricking the ears, respira- 
tion, and blood flow. These same changes 
were also recorded during responses to sim. 


ple auditory, visual, or cutaneous stimuli in 
the absence of hypothalamic stimulation. 
Since these physiological changes are the 
same as those observed in all orienting re- 
sponses, the defense reaction could therefore 
be considered in part as due to an increase 
of arousal. 


Sensory Systems and the Neuronal Model 


There is ample evidence that a host of 
other brain mechanisms are involved in the 
orienting reaction. Electrical brain record- 
ing, both with macro- and with microelec- 
trodes, shows that initial desynchronization 
and increase in neuronal discharge occurs in 
a large number of neural systems. One of 
the characteristics of familiarization con- 
sequent on repetition of a stimulus is the 
progressive restriction of these sites (John, 
1967 ; John & Killam, 1960). Paradoxically, 
however, this restriction involves primarily 
the sites where arousal neurons are located 
and does not encompass either sensory path- 
ways or the primary sensory projection sys- 
tems where responses continue to be re- 
corded in the habituated organism. These, 
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lov's model was saved, however, by the re- 
sults obtained from recordings of the activ- 
ity of visual cortex. Here, habituation 
produced no uniform decrementing—some 
records invariably and reliably showed an in- 
crementing of activity, while others decre- 
mented (over each of many days of record- 
ing) and still others showed no reliable 
changes. A patterned habituation response 
occurred, and dishabituation to a change in 
stimulation disrupted this pattern. 

Patterned changes in cortical rhythmicity 
also occur. Thus patterning of electrical 
brain activity is not limited to abrupt poten- 
tials evoked by the stimulus. Earlier studies 
have been reviewed by J. Mackworth (1969). 
More recently it was shown that during 
habituation, changes occur in both latency 
(increase) and duration (decrease) of de- 
synchronization when baseline activity is in 
the 10 Hz alpha frequency range (Bagshaw 
& Benzies, 1968; Grandstaff & Pribram, 
1972). More complex frequency analysis of 
the ongoing cortical activity of limited areas 
of the visual cortex (Grandstaff & Pribram, 
1972) has shown that a decrease in power 
with habituation in 8-10, 37-50, and 72-80 
Hz frequency ranges occurs when the ampli- 
tude of the evoked potential decreases. By 
contrast, in those electrode placements where 
the amplitude of the evoked response in- 
creases, an increase in power in the alpha 
and other critical ranges is obtained. These 
patterns of change do not occur at subcortical 
loci (optic nerve and lateral geniculate nu- 
cleus) but are limited to recordings made 
from the visual cortex, supporting the con- 
clusion that patterning of neuroelectric ac- 
tivity, an indication of the construction of a 
“neuronal model,” is primarily the function 
of a cortical information processing com- 
petency. 

The occurrence of a patterned response 
at the cortex suggests that each of the cor- 
tical sites can be considered separate informa- 
tion processing channels, which is consonant 
with the anatomy of these systems. Neisser 
(1967), Kahneman (1973), and Lindsay 
(1970) have thoroughly reviewed the be- 
havioral evidence for considering the sensory 
channels to be primarily parallel processing 
systems, or at least some form of multiplex- 


ing mechanism that relies on an initial simul- 
taneous parallel processing stage, a problem 
which will be considered in the section on 
categorizing. Here we first outline the 
neural systems involved in the arousal and 
activation that leads to desynchronization of 
the patterned neuronal model. 

Amygdala Circuits 

There is evidence for the involvement of 
the amygdala and related frontal cortical 
structures in the attentional control of the 
core brain arousal systems. This evidence 
delineates two reciprocally acting circuits, 
one facilitatory and the other inhibitory, both 
converging on hypothalamic structures re- 
lated to arousal. Such reciprocal innervation 
allows sensitive modulation (tuning) of the 
arousal mechanism. 

In pursuing the question of which brain 
systems are involved in orienting and habitu- 
ation, we first replicated and showed some 
of the limits over which the Sokolov results 
held (Koepke & Pribram, 1966, 1967). 
When adapting the techniques to the monkey 
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Figure 1. Curves for overall percentage of galvanic 
skin response (GSR) to the first 50 presentations of 
anirregularly presented 2-sec tone. (Abbreviations : 
N = unoperated group; H= hippocampectomized 
group; IT = control group; AM = amygdalectom- 
ized group.) 
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FicunE 2. Curves for a more detailed analysis 
of the first 21 trials (split into 7-trial blocks) or 
percentage of galvanic skin response (GSR) to a 
2-sec tone for the amygdalectomized, hippocampec- 


tomized, and unoperated monkeys. (See also 
Table 1.) 


we found, contrary to expectation, that le- 
sions of the mediobasal, frontolimbic fore- 
brain, and not of the convexity of the cere- 
bral mantle, had a dramatic effect on mea- 
sures of orienting (Bagshaw, Kimble, & 
Pribram, 1965; Kimble, Bagshaw, & Pri- 
bram, 1965). 

Amygdalectomy results in a failure of 
behavioral habituation, producing an ever- 
recurring locomotor orienting reaction (Bag- 
shaw & Benzies, 1968; Schwartzbaum, Wil- 
son, & Morrissette, 1961). Such hyper- 
reactivity is also produced by lesions of the 
Orbital surface of the frontal lobe (Mettler 
& McLardy, 1948; Ruch & Shenkin, 1943; 
Wall & Davis, 1951) which has been shown 
to be a part of a larger orbitofrontal-insular- 
temporal-polar-amygdala System (Kaada 
Pribram, & Epstein, 1949). Whenan analy- 
Sis of the visceroautonomic components of 
the orienting response was carried out on 
frontal and amygdalectomized r 


monkeys, it 
Was expected that these components would 


parallel behavioral orienting. In many in- 
stances the opposite was observed. 

Resections of dorsolateral frontal cortex 
always abolish the visceroautonomic com- 
ponents of orienting (Kimble, Bagshaw, & 
Pribram, 1965; Luria & Homskaya, 1970; 
Luria, Pribram, & Homskaya, 1964), while 
amygdala resections produce a bimodal dis- 
tribution, Electrodermal responses during 
orienting (Figures 1 and 2; Bagshaw & 
Benzies, 1968; Bagshaw, Kimble, & Pri- 
bram, 1965) are reduced in amplitude and 
amount for most subjects. However, certain 
subjects show an increased number and am- 
plitude of response (see Figure 3 and Table 
1). Tonic heart rate is paradoxically ele- 
vated (see below), but all phasic heart rate 
responses are abolished (Bagshaw & Benzies, 
1968; Pribram, Reitz, McNeil, & Spevack, 
1974). 

Thus, reciprocal systems appear to be in- 
volved—one apparently relates to the dorso- 
lateral frontal cortex since resections of this 
structure invariably eliminate visceral-auto- 
nomic orienting responses. The other, op- 
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TABLE 1 
AMOUNT AND AMPLITUDE OF GALVANIC SKIN RESPONSES OVER TRIAL BLOCKS FOR SUBJECTS 
Response Amplitude 
(k9) 
Group 
Trials | Trials | Trials Trials | Trials Trials 
1-7 8-14 15-21 1-7 8-14 1521 
| 
Normal (14) 63 43 3.0 | 2.9 3.2 
Hippocampectomy (7) T1* 70* 62 5.9* 5.8* 3.5 
Amygdalectomy (6) | 
Hyperresponsive (2) 90* 50 40 11.4* 4.3* 1.9 
Hyporesponsive (4) 15* 00* 00* 2.0 “ay | elsi 


Note, Number of subjects in each group appear in parentheses following the group. 
* p « .05. 


posite in function, is most likely related to 
the orbitofrontal cortex which has been 
shown to be the rostral pole of an extensive 
inhibitory pathway (Kaada, Pribram, & Ep- 
stein, 1949; Pribram, 1961; Sauerland & 
Clemente, 1973; Skinner & Lindsley, 1973; 
Wall & Davis, 1951). These data relate 
to the early descriptions of the behavior of 
amygdalectomized animals (Pribram & Bag- 
shaw, 1953) which focused on the fact that 
while the majority were tame, unresponsive 
to threat, and nonaggressive, the opposite 
finding was also occasionally observed (e.g., 
Rosvold, Mirsky, & Pribram, 1954), and 
more recent behavioral studies by Ursin and 
Kaada (1960) using restricted lesions and 
electrical stimulations have confirmed the 
suggestion that at least two more or less 
reciprocal systems can be identified in the 


amygdala. 


Attentional Control of the Arousal System 


The presence of reciprocal systems sug- 
gests the existence of a locus upon which 
they converge, a locus which, from anatom- 
ical considerations (Pribram & Kruger, 
1954), is most likely to be found in the 
hypothalamic regions of the brain stem. 
This suggestion is supported by microelec- 
trode evidence at the spinal cord level 
(Groves & Thompson, 1970) which indi- 
cates that the origin of the desynchronizing 
mechanism leading to orienting may be lo- 
calized to the medial parts of the spinal cord 
where the visceroautonomic outflow orig- 
inates, The neurons that respond during 
orienting are likely to have derived their 


input from locations near the visceroauto- 
nomic regulating mechanisms. This is borne 
out both by the results of electrical stimula- 
tions of the hypothalamus (noted above), 
the amygdala (Gastaut, 1953, 1954), and re- 
lated cortical areas (Wilcott, in press). 
Thus, not only can the visceroautonomic 
components of orienting be sharply distin- 
guished from the somatomotor components, 
but their role in the orienting mechanism ap- 
pears to be primary. 

The fact that the two amygdala systems 
are reciprocal, one facilitating, the other 
inhibiting arousal, suggests a controlling, 
modulating mechanism and is in accord with 
evidence on other control functions of the 
amygdala and related structures. Injections 
of carbachol into the appropriate hypo- 
thalamic site will initiate drinking; such in- 
jections into the amygdala have no effect 
unless the animal is already drinking, in 
which case the amount of drinking becomes 
proportional to the amount of carbachol in- 
jected in an exquisitely accurate relationship 
(Russell, Singer, Flanagan, Stone, & Rus- 
sell, 1968). The frontoamgydala influence 
can be conceived as a finely tuned determi- 
nant controlling visceroautonomic arousal 
initiated by the hypothalamic mechanism 
during orienting. It is as if in the absence 
of the frontoamygdala systems, the animal 
would fail to control his drinking behavior : 
Once started he would drink under circum- 
stances in which others would stop. This is 
exactly what happens—and more. Both eat- 
ing and drinking are controlled in this 
fashion—not only their cessation but also 
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Ficure 4. Top: Curves of percentage of galvanic 
skin response generated by three runs of stimuli 
in ascending and descending intensity (in map) by 
the amygdalectomized (AM) and control (N) 
groups. Bottom: A finer breakdown of stimulus 
values from .1 to 1.0 mA, pooled ascending and 
descending values. (From "Effect of Amygdalec- 
tomy on Stimulus Threshold of the Monkey" by M. 
H. Bagshaw and J. D. Pribram, Experimental 
Neurology, 1968, 20, 197-202. Copyright 1968 by 
Academic Press, Reprinted by Permission.) 


their initiation (Fuller, Rosvold, 


& Pribram, 
1957). 


Sensitivity and Registration 


The question arises as to the Significance 
for the orienting reaction of the visceroauto- 
nomic components per se. The absence of 
visceroautonomic reactivity during orienting 
could result simply from lowering the sensi- 
tivity of the organism to stimulation, How- 
ever, resections of frontal cortex (Grueninger 
Kimble, Grueninger, & Levine, 1965) or ot 
the amygdala (Bagshaw and Coppock, 1968. 
McNeil, 1972) do not abolish galvanic skin 
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responses to movements or to shock. When 
threshold for shock was measured, frontal 
lobectomized (Grueninger & Grue 
1973) and nonresponsive amygdalectomized 
subjects were found to have, if anything, 
greater sensitivity (see Figure 4). Specifi- 
cally, these monkeys frequently produced a 
maximal number and amplitude of responses 
to intensities that produced very little or no 
reaction in normal subjects. 
This leaves no doubt that the peripheral 
response mechanism remains intact. Neither 
a change in sensory sensitivity nor an al- 
tered peripheral response mechanism can ac- 
count for the effects of frontal and t 
lesions on the orienting reaction. The crit- 
ical influence. exerted by these forebrain 
Structures appears to relate to arousal in- 
duced registration of input effecti 
in the organization of centr: 
Certain changes relate to the finding that 
despite reactivity to shock, the amygdalec- 
tomized and frontally lesioned subjects have 
fewer "spontaneous galvanic skin responses" 
during the shock sessions, suggesting an al- 
tered base level, So far, we have referred 
to phasic changes in the initial period of 
observation, which often reach asymptote 
within three to five stimulus presentations, 
However, while behavioral and some elec- 
‘rocortical responses appear to be normal 
during orienting, after frontal and amygdala 
lesions (Bagshaw & Benzies, 1968 ; Schwartz- 
baum et al, 1961) the background level of 
these responses is lower than for controls, 
For example, ear flicking is practically ab- 
Sent during h 
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had an effect (deceleratory) opposite to ex- 
pectations. However, response factors and 
incentive factors (reinforcing consequences ) 
had strong accelerating effects. Thus, arousal, 
though it may produce a brief initial phasic 
acceleration (Obrist, Wood, & Perez-Reyes, 
1965), leads more often to tonic heart rate 
deceleration, which is indicative of activation, 
a process detailed in the next section. In 
short, the monkeys with absent arousal reac- 
tions are consistent in showing an elevated 
tonic heart rate. As Elliott's data indicate, 
and as will be detailed in the last section 
of this article, elevated tonic heart rate is 
manifest when the situation demands “ef- 
fort” on the part of the organism. 

The effects of amygdalectomy can be in- 
terpreted as follows: Because the specific 
controls on arousal are removed, behavioral 
arousal fails to lead to the registration of the 
situation by altering the neuronal model but 
results in nonspecific defensive “effort” to 
cope with the situation. This defense reac- 
tion is characterized by an attempt to shut 
off further input (see Pribram, 1969), and 
is reflected in an elevated heart rate and 
other changes indicative of a lack of readi- 
ness to respond meaningfully to the input. 
Thus “effort” is manifest in the absence of 
readiness. This interpretation is borne out 
by the results of an experiment in which in- 
fant kittens were raised in isolation. When 
examined at the age of 6 months their 
visceroautonomic and endocrine reactivity in 
orienting experiments was essentially similar 
to that of amygdalectomized subjects: They 
had not learned to cope with situations and 
thus showed the “defensive” syndrome sug- 
gestive of considerable “effort” (Konrad & 
Bagshaw, 1970). 


Summary 


Studies relating brain function and the 
orienting reaction to sensory input have 
pointed to the „Presence of a system of 
neurons responding to the amount of input 
to them by maintaining or incrementing 
their activity. This core system of neurons 
extends from the spinal cord through the 
brainstem reticular formation, including hy- 
pothalamic sites, and lies in close proximity 
to those responsible for the engenderment of 


visceroautonomic responses, By way of its 
diffuse connections, this system is responsible 
for the more ubiquitous “arousal” responses 
recorded throughout the brain concomitant 
with orienting. Forebrain control over this 
corebrain arousal system is exerted by re- 
ciprocal facilitatory and inhibitory circuits 
centered on the amygdala. These circuits 
control the onset and duration of neural 
arousal much as they control the onset and 
duration of visceroautonomic and appetitive 
responses. 

Our interpretation of the relationship be- 
tween the lack of visceroautonomic responses 
to orienting and the failure to habituate be- 
haviorally has been to suggest that a defi- 
ciency is produced in the more ubiquitous 
central mechanism by which organisms “reg- 
ister” input. When such failure in registra- 
tion occurs, the organism’s nervous system 
is temporarily swamped by the arousing in- 
put and reacts defensively to shut out all 
further input and thus leads to automatisms. 
This interpretation fits the clinical picture 
of the amnestic states (deja vu and jamais 
vu) and the automatisms occurring during 
psychomotor seizures produced by epileptic 
lesions in the region of the amygdala. There 
is also considerable congruity between this 
interpretation and those of Mednick and 
Schulsinger (1968) and of Venables (Gruze- 
lier & Venables, 1972) in their report of two 
classes (galvanic skin responders and non- 
responders) of patients diagnosed as schizo- 
phrenic. However, the interpretation also 
suffers from the difficulties that plague un- 
derstanding of these clinical syndromes: 
How do disturbances of registration in im- 
mediate awareness influence subsequent re- 
trieval? This will be discussed further in 
the following sections. 


VIGILANT ATTENTION, EFFORT, AND THE 
CONTROL OF ACTIVATION 


The interaction between behaving organ- 
isms and their environment is not one-sided. 
The organism is not just a switchboard for 
incoming stimulation, Rather, the essence 
of behaving organisms is that they are spon- 
taneously active, generating changes in the 
environment often by way of highly pro- 
grammed, that is, serially ordered, responses 
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to respond. Yet, under many circumstances 
the two reactions appear to be yoked: at the 
cortex by the TNV and in the hypothalamic 
region by the reciprocity of the "stop" and 
"go" mechanisms. In such situations they 
share the function of reflexly coupling input 
to output, stimulus to response. In the ab- 
sence of controlled arousal and activation, 
behaving organisms would be constantly 
aroused by their movements and moved by 
arousing inputs. There must be some pro- 
cess that involves both arousal and activa- 
tion which allows the uncoupling to take 
place. This process is habituation, which we 
have seen to be critical in the development of 
the neuronal model. Action generated in- 
puts (the outcomes of actions, their reinforc- 
ing consequences) appear to produce more 
complexly structured neuronal models than 
repetitions of simple inputs per se, and re- 
flect the participation of the central motor 
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systems in generating input. It takes longer 
to form a habit in, than to habituate to, the 
same situation. Complex inputs such as re- 
peated exposure to the same musical per- 
formances do not readily induce habituation 
but in such situations a good deal of motor 
readiness—listening—is also involved. 

Having considered the effects of certain 
limbic lesions on arousal, we need now to re- 
view the effects of limbic lesions on habitua- 
tion and to relate such effects to the activa- 
tion of vigilant readiness. 


Hippocampal ( Papes) Circuits 


Subjects with bilateral hippocampectomy 
tend to show a percentage of reactivity and 
amplitude of the galvanic skin response op- 
posite to that observed in the nonresponding 
amygdalectomized monkeys (see Figures 2 
and 3 and Table 1) in the ordinar 
paradigm. 


' orienting 
In addition, the phasic skin re- 
sponse terminates considerably more r 
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manipulating the distractors and thus in- 
creasing the time between the two required 
responses. However, in contrast to the be- 
havior of the controls who simultaneously 
habituated overt manipulation of the distrac- 
tors and the interresponse time, the hippo- 
campectomized group showed decrementing 
only of the overt manipulations—their inter- 
response time failed to habituate at all. In 
this situation, hippocompectomized monkeys 
continue to be perceptually distractible while 
becoming behaviorally habituated and indis- 
tractible (Figure 6). This result is iden- 
tical to that obtained in man with medial 
temporal lesions: Instrumental behavior can 
to some considerable extent be shaped by 
task experience, but verbal reports of the 
subjective aspects of experience fail to indi- 
cate prior acquaintance with the situation 
(Milner, 1958). 

The dissociation between habituation of 
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TABLE 2 


HAL¥F-Recovery RATE or GaLvanic SKIN 
RESPONSES FOR SUBJECTS 


Half-recovery rate for 
controlled amplitudes 


| (kQ/sec) 
Group | 
| Large 
| 2-4k 
Normal (14) | 460 
Amygdalectomy (6) | 
Hyperresponsive (2) | .730* 
Hyporesponsive (4) -460 
Hippocampectomy (7) | .620* 


Note. Number of subjects in each group appear in parentheses 
following the group. 
* p «.05. 


perceptual responses and habituation involv- 
ing somatomotor performance appears to be 
part of a more general effect of hippocampal 
lesions, as it is manifest in other situations 
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FIGURE 6. Distraction with visual pattern stimulus as distractor. (Abbreviations: sham = sham- 
operated subjects; amyg = amygdalectomized monkeys; hip =hippocampectomized animals.) 
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in which hippocampectomized monkeys are 
tested. In a discrimination reversal situa- 
tion, extinction of previously learned be- 
havior and acquisition of newly correct re- 
sponses were observed. However, in con- 
trast to their controls, the monkeys with the 
hippocampal lesions remained at a chance 
level of performance for an inordinately long 
time (Pribam, Douglas, & Pribram, 1969). 
This effect was due to the "capture" of the 
behavior by a position bias to respond to 
the 50% schedule of reinforcement—a bias 
hardly manifest in unoperated monkeys 
(Figure 7 ; Spevack & Pribram, 1973). This 
result suggested that a hierarchy of response 
sets was operative in the situation such that 
"observing" responses (indicative of "atten- 
tion") were relinquished when the probabil- 
ities of reinforcement of discriminative stim- 
uli ranged around the chance level. 

Taken together, these experimental results 
suggest that interference with the hippo- 
campal circuit reduces the organism to a 
state in which the more effort demanding 
relationships between perception and ac- 
tion, between observing and instrumental re- 
sponses, and between stimulus and response 
are relinquished for more primitive relation- 
ships in which either input or output cap- 
tures an aspect of the behavior of the organ- 
ism without the coordinating intervention of 
central control operations. The mechanism 
by which the hippocampal circuit accom- 
plishes this relationship has been elucidated 
to some extent by recordings of electrical ac- 
tivity from the hippocampus both with micro- 
and macroelectrodes and by precise electrical 
stimulations of selected parts of the hippo- 
campal circuit. 


The Hippocampus—Arousal and Activation 


The evidence that delineates the hippo- 
campal mechanism accounts for the general 
observation that the greatest changes in 
gross electrical activity observed during 
habituation are recorded from the core brain 
(especially mesencephalic reticular and limbic 
control systems; John, 1967 ; John & Killam, 
1960). 

The hippocampus with its three-layered 
cortex provides the best opportunity for 
observing unit activity responsible for the 
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changes observed in gross electrical activity 
in repetitive situations. Vinogradova (1970) 
found that all neurons of the hippocampus 
habituate to repetition of a stimulus and dis- 
habituate to any change in the stimulus con- 
figuration. But she distinguished two types 
of neurons, A and I: those activated (30- 
40%) and those inhibited (60%) by a stim- 
ulus, respectively. Habituation occurs by a 
progressively shortening response in the 
course of 16-20 repeated presentations. The 
averaged poststimulus histograms of the two 
classes are "mirror images of each other." 

Some important characteristics of hippo- 
campal units are that (a) the latencies of 
response to a stimulus do not change; (b) 
they are of the order of 100-1000 msec even 
initially; (c) in the ventral hippocampus 
(the only part present in primates, including 
man) stimuli must be of a minimum duration 
of from 4 to 1 sec to produce any noticeable 
change in the background activity and (d) 
such facilitation persists up to 1 min after 
the cessation of the stimulus. These charac- 
teristics indicate a necessity for a long period 
of summation to precede hippocampal facil- 
itation. Vinogradova (1970) interprets her 
findings as follows: 


The duration of reactions in hippocampal neurones 
shows that the processes continue here long after 
the information processing is finished in all specific 
sensory structures, and in primary and secondary 
areas of the cortex as well. ... As Gloor (1961) 
indicated, the quality of sensory information is al- 
most erased in hippocampal neurones. ( pp. 114-115) 


These results are in accord with proposals 
previously put forward by Douglas and 
Pribram (1966). We suggested that the 
hippocampus constitutes part of an error 
(mismatch) evaluating mechanism which 
was conceived to process only the perturba- 
tions resulting from the mismatch among 
inputs (including those consequent on re- 
sponses) (Pribram, 1971). Vinogradova is 
in agreement, therefore, in suggesting that 
precise sensory information is not involved. 
Further, she suggests a mechanism by which 
such processing can be achieved : 


The hippocampus exerts a tonic inhibitory influence 
upon the reticular formation, blocking activatory 
processes through the tonic discharge | of its I- 
neurones when novelty is absent and registration is 
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not needed. But when a stimulus which is not 
registered in the memory system appears, this in- 
hibitory control is blocked (I-neurones become 
silent), arousal occurs, and the process of registra- 
tion starts. (p. 114) 


Lindsley has recently elaborated the mech- 
anism by which the hippocampally controlled 
reticular formation can effect these changes 
in registration. Lindsley (Macadar, Chalupa, 
& Lindsley, 1974) in keeping with many 
other recent publications (e.g., Fibiger et al., 
1973; Ungerstedt, 1974) has dissociated two 
Systems of neurons that influence the hippo- 
campal circuit, One system originates in the 
median raphé and associated structures of 
the mesencephalic reticular formation, the 
other originates more laterally in the locus 
ceruleus and other portions of the periaque- 
ductal grey. We have come to know these 
two systems in the present review as a sero- 
tonergic (indole amine) "stop" mechanism 
associated with arousal, and a dopaminergic 
and  norepinephrinergic catechole amine 
"go" mechanism associated with readiness 
and activation. Lindsley’s findings were ob- 
tained by electrical stimulations of the appro- 
priate structures in the mesencephalic reticu- 
lar formation. Such stimulations of the stop 
mechanisms produced hippocampal desyn- 
chronization and at the same time a syn- 
chronization of the amygdala circuits, Taken 
together with Vinogradova's evidence, this 
suggests a reciprocal process by which the 
controls on arousal are maintained as long 
as hippocampal inhibition of the reticular 
formation is in progress—much as Vino- 
gradova suggests. Only when a mismatch 
from the neuronal model is signaled to the 
reticular formation does this inhibitory con- 
trol become loosened producing hippocampal 

desynchronization—and concomitant relaxa- 
tion, synchronization, of the arousal functions 
of the amygdala circuits. ] Andsley has found 
that often, though not always, such hippo- 
campal desynchronization is accompanied by 
desynchronization of the, sensorimotor pro- 
jection S) stems, suggesting that "registra- 
tion,” an alteration of the neuronal model 
of the cortical. representation, is occurring, 
Note that in this formulation the term “regis- 

refers to a change in the neuronal 
"registration in memory,” a 


tration" 
model, thus a 
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process that, as we shall see, requires effort. 
This use of the term registration must be 
distinguished from “registration in aware- 
ness" which, as noted earlier, is disrupted 
by interierence with the amygdala circuits. 


Theta Rhythins and Intent 


The second mechanism discerned by the 
Lindsley studies is the catechol amine “go” 
system which has been the focus of this sec- 
tion. This mechanism makes possible the 
“What is to be done” reaction, the process- 
ing of response-produced inputs. When elec- 
trically stimulated, the mesencephalic por- 
tions of the “go” mechanism initiate hippo- 
campal rhythmic activity in the theta range 
of frequencies, Early studies (Green & 
Arduini, 1954) had uncovered the paradox 
that the desynchronization of the clectro- 
encephalogram recorded from the brain's 
convexity during “activation” was accom- 
panied by synchronization in the recordings 
obtained from the hippocampus. 
such synchronization is not 
records obtained in monkey and man, com- 
puter analysis has shown it to occur and that 
it can also be studied in the primate ( Crowne, 
mont mt t rium 192). Ting. 

y ‘ neta range (4-8 
Hz) and has become the focus of a long 
series of studies, = 
. That theta frequencies are especially prom- 
“eee from the hippocampus 

D g and Kornmuller (1938). 
Later a series of studies (Green & Arduini, 
1954; Grastyan, 1959; Grastyan, Lissak 
Madarasz, & Donhoffer, 1959) described the 
occurrence of hippocampal theta to 
orienting stage in 
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I hough 
as obvious in 


al t the 
à conditioning situation 


the occurrence i 
j of hippo- 
campal theta was related instead a 


movement (Dalton & Black, 196 


Young, 1972: Black, Y 

Sag DERN, Win Be Binns 4. 
1970) : intended rather e Batenchuk, 
ment because ihe thers Overt move- 


completely cur 
been trained j 
press. 

ticipated 


urs in 
ho have 


: Series of 
(Bland & V SLO 


derwolf, 1971 


experiments 
and his associates 
72a, 1972): Van- 
+ Whishaw, Bland & 


Tue CONTROL OF ATTENTION 


Vanderwolf, 1972). Rats were observed 
while moving freely in the test situation and 
hippocampal electrical activity was recorded 
continuously. Theta activity occurred al- 
most exclusively when the rats were making 
"voluntary" movements, defined as acts or re- 
sponse sequences characterized by flexibility 
and modifiability. 

Adey and his group (Adey, 1970; Adey, 
Dunlop, & Hendrix, 1960; Elazar & Adey, 
1967; Radulovacki & Adey, 1965) took the 
investigation of behavioral effects on hippo- 
campal theta rhythms a step further by 
showing that a shift (+6 Hz) in power 
within the theta frequency range occurs in 
cats from the prestimulus period, through 
stimulus presentation, to the correct response. 
Bremner (1970) has further analyzed the 
changes in theta activity along three dimen- 
sions: (a) an increase or decrease in the 
total amount of power in the theta range; 
(b) a narrowing or broadening of the range 
of energy distribution around the peak fre- 
quency; (c) the location of that peak fre- 
quency in the electroencephalogram spectrum. 
Change in total power of theta depends on 
the visceroautonomic arousal (increase) 
versus somatomotor readiness (decrease) 
distinction delineated here, and the changes 
1m contour and their distribution correspond 
to the categorizing-reasoning distinction to 
be pursued in the latter parts of this review. 

Specifically, Bremner has used the habitu- 
ated organism (rat and man) as a baseline 
preparation and then studied manipulations 
which gave rise to orienting, simple condi- 
tioning, discrimination, and discrimination 
reversal. He found that the power (amount 
of energy) of hippocampal theta increased as 
a function of the stimulus which produces 
orienting and arousal but decreased in the 
anticipatory interval preparatory to response 
in the conditioning situation. Narrowing re- 
flects discrimination, and the location of the 
peak is shifted by discrimination reversal 
procedures, findings especially relevant to 
the results of the behavioral experiments re- 
viewed above (Figure 8). 

Thus Bremner finds that the power of 
theta produced by a distracting stimulus be- 
comes attentuated when an organism is al- 


ready preoccupied in the performance of a 
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task. Further, during discrimination re- 
versal, despite the fact that the now positive 
stimulus elicits enhanced power in the theta 
frequency range within two or three trials, 
extinction oí the prior correct stimulus shows 
a long time course and the dominant fre- 
quency may appear to shift in one direction 
or another (Gray, 1971). These data also 
suggest an explanation for the finding de- 
tailed above (Pribram et al, 1969) that 
hippocampectomized monkeys are little dif- 
ferent from normal subjects in reversal learn- 
ing except that they remain fixed at the 
chance level for an incredibly long period. 
Summary 

By way of review, we note that there 
exists evidence for the organization of a cen- 
tral representation of input, the construction 
of a (cortical) neuronal model. Changes in 
this neuronal model are controlled bv two 
subcortical systems: One is located in core 
portions of the neuraxis and contains neurons 
that increment to or monitor input, while 
another more laterally placed, contains neu- 
rons which rapidly decrement when they are 
repeatedly stimulated. In addition, we have 
delineated three brain mechanisms in the 
rostral portions of these control systems: 
(a) The first centers on the amygdala, which 
regulates the monitoring or "arousal" neu- 
rons and becomes organized into a "stop" or 
reequilibrating mechanism. (b) A second is 
centered on the basal ganglia and involves 
the activation of “go” mechanisms—expect- 
ancies (perceptual) and readiness (motor). 
(c) Finally, a third mechanism comprising 
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Figure 8. Diagram of the relationship among 


dimensions of hippocampal theta activity and 


behavior. 
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the hippocampal circuit has been identified, 
which uncouples the stimulus from the re- 
sponse by coordinating the amygdala and 
basal ganglia mechanisms so that appropriate 
changes in the central representation can oc- 
cur, a process that entails effort. 

As the amygdala is often considered to 
be one of the basal ganglia, albeit related spe- 
cifically to the olfactory and visceral systems, 
brain structures similar in morphology, 
though different in their connections, control 
arousal and activation, "Their coordination 
in effecting change in central representation, 
on the other hand, devolves on a circuit— 
the hippocampus—very different in morphol- 
ogy, and more akin to that characterizing the 
cerebellum, another mechanism notable for 
coordinating complex sequential processes, 

Control by the amygdala is achieved by a 
reciprocal mechanism: a facilitatory lateral- 
frontal-amygdala-lateral-hypothalamic and an 
inhibitory orbito-frontal-amygdala-medial- 
hypothalamic circuit have been identified, 
Both circuits operate on the monitoring (e.g., 
satiety) neurons which in the brainstem ap- 
pear to be serotonergic, that is, indole amine 
neurochemically. We suggest that the con- 
trols modulate the intensity and duration of 
the reaction initiated and thus modulate the 
organism’s arousal, a “What is it?” reaction, 
This suggestion can be readily tested by 
electrically stimulating amygdala (and fron- 
tal) locations (much as was done by the 
Lindsley, 1961, and Clemente et al, 1964 
groups) and recording with microelectrodes 
(as in Richard Thompson’s experiments 
cited in Groves and Thompson, 1970) to de. 
termine the effects on threshold and du 
of the reaction initiated by the moni 
mechanism in the appropriate 
stem (reticular, hypothalamic) neurons, 

Control over the "go" Mechanisms re. 
sponsible for the Preparatory activation of 
brain systems appears to center on the basal 
ganglia. Activation of any particular part of 
the brain is signaled by the gradual develop- 
ment of a negative potential which becomes 
abruptly positive when the Perceptual or 
motor response is executed, There is evi- 
dence that this slowly developing Negativity 
per se is primarily of local origin, The 
mechanism and pathways involved in the 
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controlling function are apparently dopa- 
minergic, part of a more extensive catechol- 
amine system, 

Hippocampal control over the organiza- 
tion of the relationship between arousal and 
activation leads to changes in central repre- 
sentations which may be conceived as 
changes of state, set, or “attitude.” Such 
change, as illustrated in the final section of 
this review, demands “effort.” Hippocampal 
function involves the notorious Papez circuit, 
largely invoked in discussions of brain func- 
tion in emotion, conceived within a visceral 
theory framework (MacLean, 1949). It is 
an interesting historica] note that Papez put 
forward his proposal within the context of 
Nina Bull's attitude theory of emotion 
(1951) and thus comes close to the views 


of the function of this circuit as proposed 
here. 
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of the components of the event-related gross 
potentials evoked in the auditory system by 
Picton and Hillyard (1974). Their suc- 
cinct summary speaks for itself: 


In conclusion we should like to propose a tenta- 
.tive synthesis of all such data into an elementary 
model for the physiological basis of human auditory 
attention. The stability of the early [initial] com- 
ponents of the evoked response would seem to indi- 
cate that auditory information is analyzed in the 
lemniscal or primary auditory system in much the 
same manner regardless of whether it is attended 
or not. A secondary auditory system, imperfectly 
defined but probably comprising reticular forma- 
tion, medial thalamus, and (írontal) association 
cortex, is involved when further evaluation of the 
significance of this auditory information is re- 
quired. Stimulus set directs the preferential input 
to this secondary system írom that part of the 
lemniscal system involved in processing the at- 
tended sensory channel. Stimulus sct is evidenced, 
therefore, by the increased N;-P. [large early com- 
ponents of the evoked potential] response of the 
frontal association cortex. The secondary system 
functions to compare input from the primary audi- 
tory system with memory models or templates of 
expected or significant stimulus alternatives, Once 
a relevant or significant signal has thereby been 
recognized, this decision is followed by appropriate 
perceptomotor sequelae. These sequelae, reflecting 
the contingencies of a response set mode of atten- 
tion are associated with the generation of the wide- 
spread P; [late components of the evoked potential] 
complex recorded from the scalp. (p. 199) 


In concluding this part of the review, we 
want to emphasize for behaviorally oriented 
psychologists the data we believe to be im- 
portant to their interests. 

Three neurally distinct and separate atten- 
tional systems—arousal, activation, and ef- 
fort—operate upon the information process- 
ing mechanism. The presumed operation of 
these control systems is perhaps best illus- 
trated as follows: The orienting reaction in- 
volves arousal but no activation; vigilant 
readiness involves activation but no arousal; 
the defense reaction involves both arousal 
and activation; when neither arousal nor ac- 
tivation is present, behavior is automatic, 
that is, stimulus-response contingencies are 
direct without the intervention of any of the 
control mechanisms of attention. Such auto- 
matisms, when they occur abnormally, are 
diagnostic of lesions affecting the medial sur- 
face of the temporal lobe, the site of two of 
the three control systems, and have also been 
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experimentally produced by extensive lesions 
of the basal ganglia (Mettler & Mettler, 
1942). 

Clearly, in the intact organism these con- 
trol systems continuously interact with each 
other as well as with the information process- 
ing systems where the representational mech- 
anisms become constituted. Figure 9 may 
help to summarize in highly oversimplified 
form the mechanisms detailed thus far, and 
it suggests the relationship to them of the 
various higher level cortical controls which 
are the focus of the discussion in the remain- 
ing parts of this review. 


SELECTIVE AND VOLUNTARY ATTENTION 


Whenever a problem demands a response 
to an invariant combination of input events 
(including those which are response pro- 
duced) the task is usually referred to as a 
discrimination, The invariant combination 
must be detected, categorized, and selectively 
attended to make the appropriate response. 
This selective attending operates on the 
processing of information. 

Another sort of problem solving occurs 
when the combination of input events is vari- 
able but computable. We will refer to these 
tasks as reasoning problems. In nonhuman 
primate experiments, the delayed response 
and delayed alternation are examples of such 
tasks; in human experimentation arithmetic 
calculations and other complex problem-solv- 
ing exercises have been used. 

Reasoning demands the uncoupling of at- 
tention from the immediate input variables— 
the attention involved is voluntary, there- 
fore, in the sense that it is initiated by the 
organism rather than by some input event. 
Such intentional voluntary processes have re- 
cently been studied extensively and are no 
longer the complete mystery they once were 
(McFarland, 1971; Miller et al., 1960; Pri- 
bram, 1962). 

The steadiness of the visual world during 
voluntary eye movement (as noted by Helm- 
holtz, 1924) is not achieved through feed- 
back from eye muscles but through a parallel 
processing feedforward biasing (Pribram, 
1971, chap. 5) or open loop (McFarland, 
1971) mechanism. Detailed experimental 
analyses by Merton of both eye movement 
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» 4 operational definition of selective and 
of voluntary attention, provided by the dis- 
crimination and reasoning tasks described 
above, will be used to illustrate how cate- 
gorizing and reasoning involve the three 
attentional processes discussed earlier, How- 
ever, we must begin with the deline 
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in fact an indicator of a change in capacity— 
“the allocation of spare capacity "—much as 
this is changed in nonliving systems by in- 
creasing the number of channels available. 
lle also goes on to equate "arousal" and 
"capacity" with "effort" and "attention" in a 
global fashion. We hope to have shown al- 
ready that arousal is indicative of but one sort 
of attention, and we will now show that effort 
is involved only when the situation demands 
the regulation of arousal and activation to 
produce a change in information processing 
competency. 

The way in which competency is con- 
trolled by brain systems in the living primate 
is demonstrated by the finding that removal 
of the area of the brain usually called sensory 
or posterior intrinsic or "association" cortex 
reduces the sampling of novel alternatives. 
The opposite effect is obtained when the 
lateral frontal cortex is resected. It was 
noted earlier that removal of this same 
frontal cortex led to an increase in behavioral 
orienting and an abolition of the viscero- 
autonomic components of orienting. There 
thus appear to be opposite effects (posterior 
and frontal) on the number of alternatives 
sampled in a situation. This was interpreted 
to indicate a dual control mechanism deter- 
mining the ability to sample (Pribram, 
1960a). 

Supportive behavioral evidence came from 
an experiment by Butter (1968, 1969) in 
which he investigated the number of features 
usually attended by monkeys while discrim- 
inating between two cues. He did this by 
eliminating each feature in turn in various 
combinations. He found that resection of 
the same brain region (the posterior cortex) 
that produced a restriction in the number of 
alternatives sampled also produced a restric- 
tion in the number of features used to make 
the discrimination, 

Electrophysiological evidence has been ob- 
tained that the posterior and frontal cortex 
contribute opposing controls on sensory 
channels. This evidence is based on changes 
produced in the recovery cycles of the sys- 
tem (the speed with which the System re- 
covers to its full capacity after a sudden, in- 
tense stimulus) and the alterations produced 
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in the shape of visual receptive fields (Spi- 
nelli & Pribram, 1966, 1967 ). 

These changes in sensory channels were, 
however, not attributed to a simple change 
in the number of channels of fixed capacity, 
as the effects of surgical resection have 
shown that as little as a few percent of an 
anatomically defined sensory channel is suffi- 
cient for ordinary discrimination learning, 
performance, and transfer (Chow, 1970; 
Galambos, Norton, & Frommer, 1967 ; Lash- 
ley, 1929). The remainder of any input 
channel appears to be redundant, spare chan- 
nel capacity, under most circumstances, We 
therefore interpreted (Pribram, 1967) the 
results on the control of input channels by 
posterior and frontal cortex as influencing 
redundancy, not sensory capacity in the usual 
information theoretic sense. Specifically, we 
suggested that the input systems acted as 
channels in which spatial and temporal mul- 
tiplexing could occur, a suggestion similar to 
that put forward by Lindsay (1970). 

We would like to suggest, therefore, on 
the basis of the data reviewed above, that 
Kahneman's (1973) concept that arousal in- 
volves an increase in the number of sensory 
channels available be generalized to include 
constraints involving the redundancy char- 
acteristics, the competency, of that capacity. 
Kahneman's discussion approaches such a 
generalization when he speaks of changes in 
"structural connections between components." 
In technical language, such changes in com- 
petency would be reflected in changes in the 
equivocation of the channel (defined as the 
sum of noise and redundancy). Competency 
is the reciprocal of equivocation. Effort can 
then be defined as the measure of the atten- 
tion "paid" to increase or maintain efficiency 
by reducing equivocation, that is, enhancing 
competency. 


Internal and. External Constraints 


Garner (1962), in his analysis of the 
structure of redundancy, has shown that the 
total amount of constraint operating in any 
System of variables can be divided into in- 
ternal and external components. Internal 
constraints refer to the relationships among 
the systems of variables under consideration, 
while external constraints refer to the rela- 
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tionship between these variables and some 
external referent system of variables. In our 
neurophysiological experiments we consider 
the constraints that describe the central op- 
eration of the channel as internal and the 
constraints that refer to operations on the 
environmental situation which control its 
sensory input as external. In addition, we 
have found it important to distinguish be- 
tween temporal (repetition of the use of the 
channel or variable over time) and spatial 
(replication of the variable over space) re- 
dundancy for each of Garner's categories. 
Specifically, we have suggested (Pribram, 
1967) that when the frontal system becomes 
involved in the orienting reaction, the in- 
ternal redundancy in the input channels is in- 
creased so that all of the information being 
simultaneously processed becomes chunked 
into one unit. By contrast, when the poste- 
rior cortex becomes involved in the atten- 
tional process, internal redundancy in the 
input channels is decreased, separating the 
bits of information in each channel from each 
other. This is concomitant with enhance- 
ment of external redundancy which, accord- 
ing to Garner's findings, enhances the ability 
to make discriminations, that is, to categorize 
input. 
In short, we suggest that the controls on 
arousal and activation operate on the mech- 
anisms of redundancy, on the constraints 
operating within and between channels rather 
than on the information being processed, 
These constraints involve the neuronal mode] 
and may be conceived of as Operating on 
memory rather than on input information, 
Another way of stating this is to say that the 
controls operate on the context in which the 
informational content is processed. 
A good deal of additional evidence can be 
cited to show that competency rather than 
sensory channel capacity per se is controlled 
by the attentional Systems discussed here, 
For instance, the studies of Anderson and 
Fitts (1958) cited by Garner (1962) show 
that as much as 17 bits of sensory informa. 
tion can be simultaneously processed. The 
work of Lindsay (1970) which demonstrates 
the relationship between sensory discrimin- 
ability (difficulty in distinguishing between 
inputs) and central processing competency 
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has already been mentioned. Pribram, Lim, 
Poppen, and Bagshaw (1966) and Mishkin 
and Pribram (1955), using various forms of 
the delayed alternation tasks, attributed the 
differential effects obtained after resections 
of the two reciprocal frontoamygdala systems 
as due to selective alterations in the struc- 
ture of internal redundancy (spatial and tem- 
poral, respectively) of the remaining process- 
ing competency, Further, Pribram and Tubbs 
(1967) have shown that when the delayed 
alternation task, the nemesis of monkeys with 
frontal lobe resections, is externally parsed 
or chunked by making the intertrial intervals 
asymmetric, the deficit is completely over- 
come. Similarly, M. Wilson (1968) ana- 
lyzed the trade-off between tasks involving 
external temporal and spatial redundancy in 
reciprocal mechanisms (anterior and poste- 
rior infero-temporal cortex) which have been 
delineated within the posterior system, 

Thus, both Kahneman's (1973, pp. 8, 9, 
15) analysis and ours attribute the control 
of arousal and selective attention to altera- 
tions in information processing channels, not 
the direct control on information and uncer- 
tainty per se. We differ in that Kahneman 
focuses on th increasing the 
capacity—the 
pacity"— while we em- 
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autonomic nervous system becomes more 
specific due to activation (basal ganglia) of 
the central representation (sensorimotor cor- 
tex) of the somatomotor mechanism, where 
phasic and tonic mechanisms become sep- 
arately involved. Further, the sympathetic 
system more often mirrors stimulus param- 
eters such as novelty and complexity, whereas 


the parasympathetic division which controls . 


heart rate reflects the parameters of the 
somatomotor readiness and response mech- 
anism—for example, the incentives, response 
biases developed on the basis of reinforcing 
consequences of behavior. This distinction 
has been clearly demonstrated in a study by 
Elliott (1969; Elliott et al., 1970) as we 
have reviewed, by Dahl and Spence (1971), 
and by Hare (1972). 

Experimenters, using autonomic measures 
during discrimination tasks in which input 
must be categorized in order to make an 
appropriate response, appear to use the term 
“arousal” as actually defined by the amount 
of categorizing required, and studies consist- 

“ently have shown that sympathetically in- 
nervated phasic responses distinguish be- 
tween variations in task largely on the basis 
of their collative, informational variables: 
their novelty, surprisingness, or complexity 
(Blaylock, 1972; Hare, 1972; Kilpatrick, 
1972; Lacey & Lacey, 1970; McGuinness, 

1973). A typical experiment showing this 
effect was that of Hare (1972) in which sub- 
jects viewing slides without making dis- 
criminations between them were less aroused 

(lower skin conductance) than those who 
had to categorize, and this was only an initial 
effect; phasic skin conductance responses 
rapidly returned to baseline (habituated) 
over trials. Difficulty due to changes re- 
quired in response biases were reflected in 
tonic, skin. conductance but most precisely 

tion in Me irt rate changes. 

The absence of a precise relationship be- 
tween galvanic skin response and response 
variables has not yet been demonstrated 
which may be because the galvanic skin re- 
sponse has no known metabolic function 
(Venables & Christie, 1973). Current data 
show that the involvement of somatomotor 
responses enabled by readiness, as well as 
tonic cardiovascular changes, provide the 
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clues to unraveling the controversial and ap- 
parently disparate views of the psychophysio- 
logical mechanism involved in attention. 


Readiness and Execution 


Two prevailing theories concerning the 
meaning of cardiovascular processes during 
attention appear diametrically opposed. On 
one hand there is the baroceptor feedback 
theory of Lacey (Lacey, 1967; Lacey & 
Lacey, 1970) which, simplified, states that 
heart rate will directly affect stimulus intake 
or rejection through a process of feedback to 
the bulbar inhibitory centers in the brain- 
stem, reducing arousal of the central nervous 
system (Bonvallet & Allen, 1963). On the 
other, Obrist and his colleagues (Obrist, 
Webb, Sutterer, & Howard, 1970a, 1970b) 
propose that heart rate is regulated by the 
motor demands of the organism and that 
these demands are controlled centrally. 
Findings by Jennings, Averill, Opton, and 
Lazarus (1971) have challenged the ade- 
quacy of either of these explanations and 
have produced results in line with an 
“amount of attention” hypothesis. They 
argue that as demands on attention increase, 
heart rate will fall in proportion to the com- 
plexity of the categorization demanded by 
the task, unless metabolic activity is engaged. 
This is somewhat in line with the view of 
this article, but their suggestions are still in- 
complete. As they note, this hypothesis does 
not explain the functional significance of 
cardiac deceleration during attention. Nor 
too, we might add, does it explain why heart 
rate always increases during reasoning. 

To resolve these conceptual difficulties, the 
physiological processes will be examined in 
more detail. The assumption of the majority 
of theories is that there is a unitary relation- 
ship between cardiovascular and somatic 
processes whereas evidence shows that the 
situation is far more complex. 

Any registered orienting response pro- 
duces a dual sympathetic-parasympathetic 
effect. The cardiovascular reflex in orient- 
ing results in sympathetically controlled and 
cholinergically mediated gross blood flow 
shifts which are sometimes accompanied by 
an initial phasic heart rate acceleration. This 
will vary in magnitude with stimulus in- 
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tensity (Graham & Clifton, 1966) and an 
initial respiratory block will often combine 
to elevate the acceleratory effect (Jennings 
et al, 1971; Petro, Holland, & Bouman, 
1970; Wood & Obrist, 1964), although the 
effect of intensity is not entirely predictable 
(Bull & Lang, 1972). We now know that 
this is due to parasympathetic inhibition and 
that all attentional (as opposed to movement- 
produced) heart rate changes are under 
vagal regulation (Eckberg, Fletcher, & 
Braunwald, 1972; Forsyth, 1970; Obrist, 
Howard, Lawler, Sutterer, Smithson, & 
Martin, 1972). 

The initial phase is followed immediately 
by a heart rate change in the direction of 
deceleration, which is due to the vagus re- 
Stabilizing the system, If the prepared so- 
matic systems become mobilized to initiate 
an appropriate consummator 
tern—eating, drinkin 


During a vigilance or categorization task 
the time course of the Tesponses is somewhat 
different (Lacey & Lacey, 1970). Now the 
vagal restabilization phase is extended as 
part of a tonic activation—the vigilant readi- 
ness process. Lewis and Wilson (1970), 
for instance, examined cardiac responses to 
a picture-matching task in young children, 
In all children, a marked deceleration oc. 
curred which lasted until the choice was 
made. The most interesting finding was 
that the greatest deceleration was related to 
correct responses—if a subject was asked to 
“guess again,” heart rate did not return to 
baseline until the final choice. Correct re- 
sponses occurred with longer response times 
and greater cardiac deceleration, which sup- 
ports the suggestion that the vigilant readi- 


ness phase of the control Processes has been 
extended. 


Correlation to IO demonstrated that there 


was a relationship to both deceleration and 
error score for girls only. The sex differ- 
ence finding could possibly be due to the 
greater amount of movement found in boys 
which would cause a heart rate acceleration 
and less attention to the task. This explana- 
tion is made more tenable by the recent find- 
ing of Obrist, Howard, Hennis, Murell (in 
press) that children’s reaction times are 
slower in proportion to the amount of uncon- 
trolled movement they produce. The rela- 
tionship between motor control and reaction 
time was linear with age. As reaction times 
increased, motor responses and heart rate de- 
creased. We therefore suggest, in line with 
Obrist et al. (1970a, 1970b), that when ac- 
tivation must be maintained, heart rate de- 
celeration occurs as an accompaniment of 
somatomotor readiness. 

This then partially accounts for the se- 
quence of physiological events in a certain 
type of problem solving. When the organism 
prolongs attention to the complexity of input 
in a categorizing discrimination situation, 
he reduces extraneous noise by eliminating 
random movements, Heart rate then adjusts 
to the reduced demands of the system re- 
flecting characteristics of the somatomotor 
system in which overt movements regularly 
precede blood flow changes and heart rate 
acceleration. Petro et al. (1970) found that 
after a voluntary contraction of the biceps 
muscle, heart rate increased with a latency 
of approximately 500 msec, However, when 
Teasoning is initiated with its emphasis on 
Which response to produce when, and the 
actual trial-and-error making of responses; 
the cardiovascular and Somatomotor response 
relationship is the reverse. 
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(categorizing) or increase (reasoning) in 
isotonic contraction, but, in addition, activa- 
tion of the readiness mechanism may also 
change a special sort of metabolism, usually 
thought to be largely anaerobic, which has 
been shown to be involved during concentra- 
tion and problem solving and involves iso- 
metric contraction. 

An understanding of this special metabolic 
process during attention has been provided by 
Berdina, Kolenko, Kotz, Kuzetzov, Rodinov, 
Savtchencko, & Thorevsky (1972). Until 
this study there was no functional explana- 
tion for the finding that sympathetically in- 
nervated blood flow shifts actually reduce 
oxygen uptake by the muscle while somatic 
activation increases it (Rosell & Uvnis, 
1962). The study of Berdina et al. suggests 
the possibility that this special sort of metab- 
olism of muscle fibers is due to cholinergic 
mediators. Berdina et al. were interested in 
the effect of problem solving on blood flow 
and muscle contraction. Until their study, 
it had been extremely difficult to establish 
any precise relationship between electromyo- 
gram recording and problem solving due to 
the small amount of amplification produced 
by the technique, which often results in un- 
reliable values (Jennings et al., 1971; Lader, 
1965; Matthews & Lader, 1971). To avoid 
this difficulty they introduced a condition of 
partial muscular contraction: Subjects had 
to grip a hand dynamometer which they were 
pretrained to maintain at 30% of their max- 
imum grip. Then, during the problem-solv- 
ing experiment, subjects were asked to grip 
the dynanometer for as long as possible. 

Both contraction alone and problem solv- 
ing (arithmetic) alone caused significant 
blood flow changes. However, a combina- 
tion of contraction plus arithmetic produced 
no greater changes in blood flow than either 
condition alone. On the other hand, the con- 
traction duration was significantly increased 
by arithmetic from 181 to 235 seconds (p < 
.001). To disentangle voluntary effects from 
the purely reflex effects, they induced a fore- 
arm contraction by electrical stimulation. 
The results were the same even when the 
subject had no voluntary control over his 
muscles. An injection of atropine signifi- 
cantly decreased blood flow during arith- 
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metic and during contraction plus arithmetic 
but not during contraction alone. Atropine 
also reduced contraction duration during the 
combined task showing that the increased 
contraction during problem solving was not 
a somatic effect. Since small amounts of 
circulating atropine do not affect the neuro- 
muscular junction, it was concluded that 
some form of sympathetically controlled 
metabolism was causing muscle force to in- 
crease, making muscles “work” under the 
special conditions of isometric contraction. 
Apart from other considerations, this experi- 
ment demonstrates conclusively that muscu- 
lar effects do occur during problem-solving 
tasks and that they involve gross changes in 
tonicity which cannot always be observed by 
electromyogram recording. Equally relevant 
is the fact that as heart rate correlates so 
precisely with blood flow changes (Matthews 
& Lader, 1971), an increase in blood flow 
during problem solving automatically implies 
a corresponding increase in heart rate. The 
Berdina experiment thus explains the func- 
tional significance of the heart rate changes 
during categorizing and reasoning: Main- 
taining attention while problem solving is 
effortful and involves “hard work” and hard 
work is accompanied by metabolic changes. 
Tt does appear from this study that the out- 
lines of the metabolic processes operating 
during intense concentration have been estab- 
lished. 

In summary, the discovery that two types 
of muscle activation (isotonic and isometric) 
and two metabolic functions (aerobic and 
anaerobic) are involved in both voluntary 
attention and action makes it easier to under- 
stand why some confusion has arisen in the 
study of the psychophysiology of attention. 
Returning briefly to the Lacey hypothesis, 
physiological responding during stimulus in- 
take will depend entirely upon how a task is 
constructed and whether it is primarily a 
categorizing or reasoning problem. We 
agree with Lacey that during a state of be- 
havior characterized by heart rate decelera- 
tion, we are most open to environmental 
cues (Lacey & Lacey, 1970) but not neces- 
sarily for the reasons he suggests (see also 
critique by Hahn, 1973). Lacey's theory 
also requires acceleration when decisions are 
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made. How then could one interpret what 
is occurring in acquiring a caleporizing, ‘or 
motor skill? When the coordination of 
arousal and activation is demanded, heart 
rate acceleration reflects the amount of effort 
involved in registering, adjusting, and chang- 
ing the central representation to the require- 
ments of the task. In accord with Lacey’s 
assumptions, intake of information must al- 
ternate with concentration, A process en- 
sues that must constantly shift between rele- 
vant stimuli in order to overcome the rapid 
decrementing of the system (just as con- 
tinuous eye movements overcome the rapid 
adaptation of retinal receptors), 
plays tennis, for example, 
vated, but the player must flexibly attend 
(be aroused by) such cues as the angle and 
velocity of the approaching ball, the ground 
lines, the height of the net, and the position 
and angle of the opponent’s body and his 
racket. As well, the intricacy of the series 
of highly refined changes in response and 
the transformational calculations based on 
this evidence could also elevate heart rate, 
even without the behavioral activity of play- 
ing tennis occurring (eg. in a Spectator). 
This is brought out clearly in a study by 
Johansson and Frankenhaeuser (1973) in 
which high heart rate accompanied the com- 
plex transformations required during an in- 
take task. 

By contrast, in reasonin 
precedes arousal. For instance, in playing a 
game of chess or engaging in any similar 
pursuit requiring a good deal of skill, trans- 
formational calculations must occur before 
the effective response is achieved. Once the 
"move" has been made, it may arouse an 
"aha" reaction: The move Was successful, or 
"Oh dear, why didn't I see that!" Again 
the physiological changes that occur are those 
that have been shown in the Berdina experi- 
ments, although the theories of Jennings 
et al. (1971) and Obrist et al. (1970) would 
predict the opposite, 

Some of the task varia 
arousal and activation and 
to overcome resistance t 


prior set during categoriz 
have also been 
Spence (1971), 


When one 
heart rate is ele- 


g tasks, activation 


bles which affect 
the effort required 
o extinction of a 
ing and reasoning 
investigated by Dahl and 
They propose an activation 
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theory which adopts Bergum's (1966) tech- 
nique of taxonomic analysis of performance, 
in which tasks are evaluated according to 
complexity and activation of response. Task 
demands were rated by independent judges, 
and the amount of heart rate change was cor- 
related to each task. Heart rate increased 
with response demand, and all correlations 
were significant. A subsequent factor analy- 
sis revealed a major factor which they desig- 
nated “density,” or degree of concentration 
required ; in other words, the total amount of 
cognitive effort involved in the task. The 
other factor which accounted for the greatest 
amount of variance in task performance was 
information content of relevant stimuli. 
Thus, again, collative variables are distin- 
guished from the amount of transformational 


effort or “work” required in problem solu- 
tion. 


Cognitive Effort 

Do these results mean that cognitive effort, 
the effort of thinking and problem solving, 
is due entirely to the activation of periph- 
eral muscular mechanisms as reflected. in 
changes of heart rate? We cannot answer 
this question conclusively at this time, and in 
fact, there may be no conclusive answer, at 
least not where normal intact problem-solv- 
ing organisms are concerned. The neuro- 
muscular apparatus Works as a unit, brain 
and body inseparably yoked. Yet Malmo 
(Malmo & Belanger, 1967) and Lacey 
(B. C. Lacey & J. I. Lacey, 1974) have made 
an excellent case in their extensive, detailed, 
and perceptive reviews that there is good 
evidence to believe that the effort involve 
mM Cognitive work js not wholly due t° 


: . n. 
changes Observed in the muscular syste 
Further evidence com 


.urth es from pathologica 
Situations such as complete cervical sp!na 
transections which arise when partial separa" 
tion between brain and body occur. 

have already reviewed the fact that theta 
activity in the hippocampal formation, ust- 
ally present when the animal moves about 


voluntarily, is also recorded under the same 
circumstances in curarized and therefore 
completely paralyzed subjects, A broke? 
neck that completely severs the muscular sy 
tem from control by the brain does not pre 
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clude effortful problem solving. The com- 
petency to think remains. 

We know from observations of sensory 
disturbances that processes which are initially 
localized peripherally become progressively 
neuralized. Thus phantom limb pain follow- 
ing an amputation can be successfully treated 
immediately by injections of an anaesthetic 
at the site of amputation. After a few 
months, however, such peripheral injections 
usually fail to have an effect, and spinal 
cordotomy (section of the spinothalamic 
tracts) must be done to relieve the disturb- 
ance. Should several years of intractible 
pain be experienced before treatment, even 
spinal cordotomy often proves useless. The 
brain itself must be operated upon: Frontal 
leukotomy or parietal lobectomy are now re- 
sorted to. 

It is thus plausible that the neuromuscular 
processes demanding effort are to a large 
extent initially peripheral in their origin and 
manifestation. But as problem solving pro- 
gresses and problem-solving skill develops, 
the effort becomes more and more a con- 
comitant of the brain processes involved— 
apart from, or only reflected in, the periph- 
eral manifestations—and thus becomes truly 
cognitive. How then are we to study and 
understand the brain processes that demand 
effort? 

Studies of local differences in cerebral 
blood flow and metabolism provide the ob- 
vious answer (for review see Ingvar and 
Lassen, in press). However, it is also help- 
ful to define the issues involved, especially 
since overall cerebral metabolism is aug- 
mented so little by even the most effortful 
problem-solving activities. Just as quantifi- 
able definitions of information and uncer- 
tainty become useful in understanding in- 
formation processing, so quantitative defini- 
tions of effort, work, and energy become 
useful in understanding control mechanisms. 
We have already noted that effort can be 
defined as the measure of attention “paid” to 
increase or maintain the efficiency of a com- 
munication channel by reducing its equivoca- 
tion. As detailed, such reduction can be ac- 
complished by reducing noise or changing 
the structure of the redundancy of the com. 


munication—chunking or parsing being 
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prime examples of such a process. McFar- 
land (1971) has approached the problem 
from a control rather than an information 
theoretic stance and comes to the identical 
conclusion. In keeping with engineering 
tradition, he defines work (pp. 215-220) as 
the measure of the rate of changing the state 
of a system and energy as the capacity for 
doing work. Thus the changes in neuronal 
representational model, changes in informa- 
tion processing competency, in set, attitude 
or state, developed by the hippocampal cir- 
cuit described in this review, demand work— 
by definition. Further, resistance to change 
may occur when changes take place in re- 
lated systems and the changes are not inde- 
pendent of one another: For example, stim- 
ulus and response may originally be coupled 
as in a defense reaction—that is, internally 
constrained by a preexisting neuronal model. 
In such a situation, a change in state is pro- 
duced only when the input and output sys- 
tems become uncoupled by the hippocampal 
circuit. In a distraction situation, mainte- 
nance of state involves loosening an external 
constraint as we have seen, and the un- 
coupling of arousal and activation, thus in- 
creasing internal constraint. As an analogy, 
change in blood sugar level may be depend- 
ent upon, or constrained by, change in os- 
motic pressure and basal temperature and 
motor activity so that the behavior of an 
organism must be analyzed in relation to 
Such constraints (see for example, McFar- 
land, 1971). 

The constraints operating in a system can 
often be expressed in terms of integrable dif- 
ferential or algebraic equations, in which case 
they are known as “holonomic constraints." 
Holonomically constrained systems allow 
description in terms of “the number of gen- 
eralized coordinates required to describe the 
state of the system and this number is known 
as the ‘degrees of freedom’ of the system” 
(McFarland, 1971, p. 217). In systems 
showing holonomic constraints, it should 
therefore be possible, in principle, to measure 
resistance and therefore the effort necessary 
to overcome the resistance. For instance, in 
discrimination reversal and in delayed alter- 
nation, the rate of extinction of the hippo- 
campal theta peak to the previously rewarded 
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Ficure 10. Comparison of response-locked ac- 
tivity evoked in temporal cortex when monkeys 
are performing (90% correct) color (top panels) 


and pattern (bottom panels) discrimination. The 
average evoked response (300 trials) was produced 
by the stimulus configuration shown in the diagrams 
between the panels. Each tracing =250 msec prior 
to and 250 msec just after each response. Note that 
during the color discriminations the Ist and 4th 
(and the 2nd and 3rd) traces are similar, while 
during pattern discriminations the Ist and 3rd 
(and 2nd and 4th) traces are alike. These similar- 
ities reflect the position of the color cues in the 
color task and the position of the patterns in the 
pattern task. Position per se, however, is not en- 
coded in these traces. Note that this difference oc- 
curs despite the fact that the retinal image formed 
by the flash stimulus is identical in the pattern and 
color problems. (Abbreviations: R* = Red rein- 
forced; Gt= Green reinforced; C+ = Circle rein- 
forced; S* — Square reinforced.) 


stimulus might be used quantitatively in mea- 
suring the resistance involved in Switching 
attention from one cue to another. In a 
selective attention experiment, where first 
one then another dimension of a multidimen- 
sional cue is reinforced (see Figure 10; 
Pribram, Day, & Johnston, in press; Roth- 
blatt & Pribram, 1972), recording of hip- 
pocampal theta and mapping cortical CNV 
or desynchronization during the changeover 
from one dimension to the other might be 
similarly useful. Neurologically, this mea- 
sure of resistance would reflect the rate of 
release from entrainment of rhythmi 


C pro- 
cesses by 


a temporary dominant focus or 
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pacemaker. The problem is at the forefront 
of our capacity to conceptualize it; however, 
the direction in which conceptualization must 
proceed has been given: We must ascertain 
whether the constraints operating in a par- 
ticular behavioral system are holonomically 
related and determine the number of coor- 
dinates that are required to describe it (the 
degrees of freedom involved). We can then 
perform rigorous quantitative experimental 
analyses of changes of state (transforma- 
tions) of each of the coordinate subsystems 
and thus finally arrive at the resistance of 
the system as a whole and the effort involved 
in overcoming this resistance. E. Roy John 
(Bartlett & John, 1973 ; John, Bartlett, Shimo- 
kochi, & Kleinman, 1973), Adey (1967), 
Callaway and Harris (1974), and our lab- 
oratory (Pribram, Day & Glick, in press) 
have begun this type of analysis and initial 
results, at least, are promising. 

This review has dealt with initial specifica- 
tion of the locus of resistance in the attention 
mechanism. We have outlined in a very pre- 
liminary way that at least two major subsys- 
tems, a visceroautonomic “arousal” and a 
somatomotor “activation” mechanism, must 
be considered. We have also reviewed the 
evidence for a control mechanism operating 
on each system (the amygdala circuits for 
arousal; basal ganglia for activation) and a 
mechanism (the hippocampal circuit) that 
operates to overcome the constraints which 
would ordinarily maintain a tight join be- 
tween stimulus and response and thus keep 
these two control systems from functioning. 
We have suggested that overcoming these 
constraints may demand effort, Whether the 
constraints in attention operate holonomica y 
or nonholonomically remains to be deter 
mined; but the techniques for determinatio? 
are available. Should it turn out that m 
straints can be expressed or even appro, 
imated in terms of integrable different 
equations, a measure on cognitive effo! 
would be attainable. 


jus 
Whether the attention comes by grace of gen 


a 
or by dint of will, the longer one does attend "he 
topic the more mastery of it one has. e 
faculty of voluntarily bringing back a W 
attention, over and over again, is the very 
judgment, character, and will (James, 1950, P- 
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THEORETICAL NOTES 


Generation-Recognition Theory and the Encoding Specificity Principle 


, Edwin Martin 
University of Michigan 


Tulving and Thomson recently contradicted the generation-recognition theory 


of recall with the phrase “recognition failure of recallable words.” 


This 


phrase, however, is not an admissable summary of their experiments. The 
word LIGHT in the cue-target pair head LIGHT is not the same as the word 


LIGHT in the free-association pair dark LIGHT. 


Subjects who recall LIGHT 


to head but refuse to identify the LIGHT that goes with dark as the same as 
the LIGHT that goes with head are not failing to recognize a recallable word 
because the word in question is in fact two different words. Tulving and 
'Thomson's argument against generation-recognition theory involves a prob- 


lematic shift in level of discourse. 


According to generation-recognition theory, 
what recall consists of comes in two parts: (a) 
The subject scans his memory and comes up 
with a set of response candidates ; and (b) the 
response candidates in this set are tested against 
a recognition criterion of some sort to the end 
that one, some, or all of them are accepted and 
hence overtly produced. The general class of 
generation-recognition models is thus concerned 
with specifying the structures and processes 
that result in retrieval of a set of plausible re- 
sponses and with specifying the nature of the 
screening process that partitions this set into 
acceptable and unacceptable subsets. 

An important feature of this theory is that 
generation and recognition are ordered pro- 
cesses. The decision criteria that together con- 
stitute the recognition test are applied after 
the retrieval process has supplied a list of re- 
sponse candidates. The importance of this fea- 
ture lies in the fact that all one needs to do to 
produce a thoroughly damaging counterexample 
is to demonstrate recognition failure of recall- 
able words. Indeed, the staged ordering of 
generation and recognition seems intuitively to 
be the feature of this theory most directly ap- 
proachable by experimentation and hence the 
feature most vulnerable to contradiction. Given 
this much, plus the fact that a demonstrated 
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counterexample is always an exciting event in 
any science, we are seemingly left with the con- 
clusion that Tulving and Thomson (1973) have 
presented us with an exciting event: They 
have reported data from three experiments, and 
on the basis of these they have concluded that 
nonrecognizable words are recallable and there- 
fore that generation-recognition theory is 
denied. 

The primary purpose of this article is to 
argue that Tulving and Thomson have not 
demonstrated recognition failure of recallable 
words. A logically related secondary purpose 
is to point out that Tulving and Thomson have 
blurred two levels of analysis. What I mean 
by this is that their arguments and conclusions 
involve undetected or unacknowledged switch- 
ing between statements about data and state- 
ments about theoretical entities, a result of 
which is a faulty attack on generation-recogni- 
tion theory. 


Tue EVIDENCE 


The evidence presented by Tulving and 
"Thomson may be schematized as follows: First; 
the word pair head LIGHT was presented to the 
subject. The subject knew he was later to be 
tested for his ability to produce rromT in TO 
sponse to the cue head and subsequently did so 
with about 60% success. After presentation 9 
the pair head LIGHT and before the test for 
recall of LIGHT to head, the same subject was 
asked to free associate to the stimulus 
dark, The word LIGHT was among the 
ciates 60-80% of the time, When then 
to recognize the word LIGHT (which 
just generated as an associate of dar” 
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TABLE 1 
MATERIALS USED IN THE TULVING-THOMSON EXPERIMENTS 


List A List B 

Recall cue Free-association cue Target word Recall cue Free-association cue Target word 
ground hot COLD* hope low HIGH* 
head dark LIGHT* stem long SHORT* 
bath want NEED» whiskey lake WATER 
cheese grass GREEN moth eat FOOD 
stomach small LARGE" cabbage square ROUND* 
sun night DAY glass soft HARD* 
pretty sky BLUE country closed OPEN"! 
cave dry WET= tool finger HAND 
whistle tennis BALL memory fast SLOW* 
noise blow WIND covering lining COAT 
glue table CHAIR barn clean DIRTY* 
command woman MAN spider eagle BIRD 
fruit bloom FLOWER crust bake CAKE 
home bitter SWEET* deep bed SLEEP 
grasp infant BABY’? train white BLACK”! 
butter rough SMOOTH? mountain leaf TREE 
drink tobacco SMOKE cottage hate LOVE 
beat ache PAIN? art boy GIRL 
cloth lamb SHEEP» adult labor WORK? 
swift stop GO brave strong WEAK 
lady king QUEEN door color RED 
blade scissors CUT roll carpet RUG^ 
plant insect BUG» think dumb STUPID’? 
wish soap WASH exist human BEING 


Note, opio from ''Encoding Specificity 
Psychological Review, 1973, 80, 352-373. C 

å Combination of target word and free- 
is a specific denotative noun phrase. 


nd Retrieval Processes in E 
‘ight 1973 by the American Psy 
ssociation cue names an abstract dimens 


;" by E. Tulving and D. M. Thomson, 
‘chological Asso: on. Reprinted by permission. 
: combination of target word and recall cue 


b Combination of target word and free-association cue names a general concept or disposition; combination of target word and 


recall cue names a concrete manifestation, 


original partner of the recall cue head, the 
subject was only 20-30% successful. It is 
important to note that this 20-30% recognition 
rate is considerably lower than the approximate 
60% recall rate mentioned above. This is im- 
portant because it is the evidential basis for 
Tulving and Thomson's conclusion that non- 
recognized words nevertheless can be recalled. 

Tulving and Thomson used 48 words, of 
which rrcmr was only one, and of course the 
recognition and recall rates varied from word 
to word. My choice of LIGHT was purely ex- 
pository. But before I defend the proposition 
that the LIGHT example is representative of the 
whole set of 48 words, let me introduce my 
general conclusion by again using LIGHT. Very 
simply, the results schematized in the preced- 
ing paragraph may be summarized by saying 
that Tulving and Thomson’s subjects were 
pleased to pair LIGHT with head and LIGHT with 
dark, but reluctant to agree that the LIGHT that 
went with dark is the same as the LIGHT that 
went with head. In other words, Tulving and 


Thomson's subjects sensibly distinguished LIGHT 
(lamp) from LIGHT (luminance). True, neither 
LIGHT nor any of the other 47 words they used 
is as transparently homographic as, say, 
PITCHER (container) — PITCHER (baseball), but 
then subjects are considerably more subtle and 
flexible than researchers. If this were not true, 
the memory area would have become theoret- 
ically stable a long time ago. 

The set of materials used by Tulving and 
Thomson is reproduced in Table ly The three 
columns for each subset give corresponding re- 
call cues (like head), free-association cues (like 
dark), and target words (like Licut). To 
many of the target words I have added a super- 
script. Consider first those that have an c 
superscript. When each such target word is 
combined with its free-association cue, the com- 
bination names an abstract dimension of gen- 
eral experience. In contrast, when combined 
with its recall cue, the combination forms a 
specific denotative noun phrase. Thus, for ex- 
ample, white BLACK names à brightness con- 
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tinuum while train BLACK denotes a concrete, 
imageable thing. Much the same can be said 
about the target words with a b superscript. 
All of these, in conjunction with their free-as- 
sociation cues, name general concepts or dis- 
positions. When combined with their recall 
cues, however, they specify concrete manifesta- 
tions of those concepts or dispositions. Thus 
ache PAIN is a general disposition while beat 
PAIN denotes a specific agentive cause and its 
effect. The rest of the list is not so easily 
partitioned in categorical terms, but taken one 
by one, the same type of analysis can be made. 
For example, tobacco SMOKE must certainly en- 
tail a different representational encoding than 
drink swMoKE, the former suggesting something 
concrete like a cigarette or a pipe and the 
latter something abstract like a style of life. 

The point of all this is that all of the target 
words in the list are on a par with the LICHT 
(lamp) - LiGHT (luminance) case. Accord- 
ingly, Tulving and Thomson’s summary phrase 
“recognition failure of recallable words” is in- 
accurate. It is inaccurate because it states that 
the words that were not recognized are the 
same as the words that were recalled, when in 
fact they are not the same words at all. 

The target words in Table 1 are listed from 
top to bottom in increasing order of success in 
the recognition test (Tulving & Thomson, 1973, 
p. 363). Thus the BEING generated as a free 
associate to human was more likely to be recog- 
nized as the BEING that went with the recall 
cue exist than was the HIGH generated as a 
free associate to low to be recognized as the 
nicH that went with the recall cue hope. What 
this means is that the recall and free-association 
cues varied in the degree to which they primed 
similar semantic encodings of their target 
words. This result highlights a generally un- 
appreciated fact that homography is not a yes- 
no classification scheme, a point that recently 
has clearly been made by Anderson and his 
associates (Anderson & Bower, 1974; Reder, 
Anderson, & Bjork, 1974). Given that all 
words are ambiguous to some degree in the 
absence of a determining context, it follows that 
encoding specificity effects should be found just 
about everywhere (e.g., Tulving, 1974). 


ENCODING SPECIFICITY 


The encoding specificity principle may be 
stated as follows: 


What is stored is determined by what is perceived 
and how it is encoded, and what is stored deter- 
mines what retrieval cues are effective in providing 
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access to what is siored (Tulving & Thomson, 
1973, p. 353). 


The bulk of the earlier evidence for this prin- 
ciple (see Tulving and Thomson, 1973, for a 
summary) has been of the sort that if, at the 
time of input, the target word LIGHT has with 
it a normatively weak cue like head, then in a 
subsequent cued recall test head is a more ef- 
fective prompt than is a normatively strong sub- 
stitute cue like dark. Whether or not the 
normative strengths of association of head and 
dark with Licut have anything to do with cue 
effectiveness in this situation is another ques- 
tion. The point here is that at the time of in- 
put the subject is seen as storing LIGHT (lamp), 
hence he should certainly be better off in a 
prompted recall test if LicHT (lamp) rather 
than Lic (luminance) is what gets primed. 
And clearly this is what the above quote from 
Tulving and Thomson says. 

Note that the encoding specificity principle 
talks about encodings-what is stored-not about 
nominal stimuli and responses. In other words, 
note that the encoding specificity principle talks 
about LIGHT (lamp), LiGHT (luminance), 
PITCHER (container), and PITCHER (baseball), 
not about LIGHT and PITCHER. Indeed, this dis- 
tinction is fundamental to the whole idea of 
encoding specificity. Having noted this, it is 
obvious that the results of the three experiments 
reported by Tulving and Thomson are in com- 
plete agreement: Subjects are expected to re- 
trieve LIGHT (lamp) to the cue head but to 
resist identifying LIGHT (luminance) as LIGHT 
(lamp). 

What sense, then, are we to make of the 
summary phrase "recognition failure of recall- 
able words"? Does it mean that LIGHT is 
simultaneously recallable and not recognizable? 
Relative to the encoding specificity principle: 
the answer is ^no." This is because the encod- 
ing specificity principle does not talk about 
LiGHT, only about LIGHT (lamp) and LIGHT 
(luminance). Does it mean that LIGHT (lamp 
is simultaneously retrievable and not recogniz 
able? Again the answer must be “no,” this 
time because the experiments did not address 
the question. In fact, the experiments wer? s 
signed to guarantee LIGHT (lamp) in retrieve 
but LrcHT (luminance) in free association » 
recognition. In short, the summary , is 
uu. FS ` ” anti 
recognition failure of recallable wo” S je 
thetical both to encoding specificity 45 7 od 
ceptual notion and to the experimental p 
cedures whose results are justifiably taken 


demonstrate encoding specificity: 


f 
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THE ARGUMENT 


Tulving and Thomson’s attack on the genera- 
tion-recognition theory of recall exemplifies a 
phenomenon worth noting—namely, a shift in 
level of discourse. Before describing this shift, 
though, let me restate the feature of generation- 
recognition theory under attack. A retrieval 
process is presumed to generate a set of candi- 
dates, each of which is potentially translatable 
into an overt act known as recall. This set of 
candidates is then evaluated by a recognition 
process that sorts out the unacceptable candi- 
dates. Thus, first comes a retrieval process 
and then comes a recognition process. If LIGHT 
(lamp), say, passes through both of these staged 
processes, then the subject will say LIGHT. If 
the recognition criterion is set for LIGHT (lamp) 
and against LIGHT (luminance), the subject will 
say "no" if presented with a recognition test of 
LIGHT (luminance). 

Now, what about Tulving and Thomson's 
charge that nonrecognition of recallable words 
refutes generation-recognition theory? If by 
"words" they mean entities like Lic (lamp), 
then their experiments, by design, have not ad- 
dressed the issue, as I pointed out earlier. If 
by “words” they mean entities like LIGHT, then 
what they have done is dismissed a critical dis- 
tinction, a distinction of central importance to 
encoding specificity theory. This is a shift in 
level of discourse, a generalization from LIGHT 
(lamp) and ticHr (luminance) to just plain 
LIGHT. This is a curious shift in that both the 
encoding specificity principle and the experi- 
mental results they report argue that this type 
of generalization is wrong. Moreover, Tulving 
and Thomson have, by omission, improperly 
imputed this shift to generation-recognition 
theorists as well. Kintsch (1970) writes: 


The meaning of a formal unit . . . is defined by its 
relationship with other formal units. That is, a 
word does not have meaning in itself, but meaning 
in the system is given entirely by context... . The 
meaning of a word is known only if it is contrasted 
with the meaning of a related or opposed term 
(p. 354). 


Clearly, the unit of analysis here is LIGHT 
(lamp), not Licur. Thus what Tulving and 
Thomson have done, contrary both to their own 


position and to that of generation-recognition 
theorists, is move the whole conversation to a 
level that seems to entail dispute but in fact is 
vacuous. 

Consider the following three statements: (a) 
LIGHT (lamp) is retrieved and recognized, and 
the subject said ticur. (This is what gen- 
eration-recognition theory would say about a 
recallable word.) (b) riGur (lamp) was re- 
trieved, and LIGHT (luminance) was not identi- 
fied as LIGHT (lamp). (This statement is com- 
patible both with the encoding specificity prin- 
ciple and with generation-recognition theory.) 
(c) LicHT was recalled but not recognized. 
(This is the false summary phrase.) Tulving 
and Thomson experimentally substantiate b but 
use c in arguing against a. 

Undetected or unacknowledged shifts in level 
of discourse often give rise to irrelevant com- 
parisons and hence pseudodisagreements. I say 
"pseudo" because they are disagreements that 
in principle cannot be decided by experimenta- 
tion, no matter how ingenious. Resolution of 
such disagreements can come only by referen- 
tial analysis of the linguistic statements made 
by the contenders. In the present case, the con- 
clusion is that Tulving's encoding specificity 
principle, while in itself a cogent and properly 
influential concept, is currently either unrelated 
or complementary to the equally sensible view 
that recall is a staged combination of generation 
and recognition of response candidates. 


REFERENCES 


Anderson, J. R., & Bower, G. H. A propositional 
theory of recognition memory. Memory & Cog- 
nition, 1974, 2, 406-412. 3 

Kintsch, W. Models for free recall and recogni- 
tion. In D. A. Norman (Ed.), Models of hu- 
man memory. New York: Academic Press, 1970. 

Reder, L. M., Anderson, J. R., & Bjork, R. A. A 
semantic interpretation of encoding specificity. 
Journal of Experimental Psychology, 1974, 102, 
648-656. . 

Tulving, E. Recall and recognition of semantically 
encoded words. Journal of Experimental Psy- 
chology, 1974, 102, 778-787. 

Tulving, E, & Thomson, D. M. Encoding spe- 
Cificity and retrieval processes in episodic mem- 
ory. Psychological Review, 1973, 80, 352-373. 


(Received May 2, 1974) 


Psychological Review 
1975, Vol. 82, No. 2, 154-155 


Features and Semantic Memory: Set-Theoretic or Network Model? 
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This brief note attempts to clarify an issue recently raised by Smith, Shoben, 
and Rips concerning network versus set-theoretic models of semantic mem- 
ory. It is argued that their set-theoretic model can, without loss of explica- 
tory power, be formulated as a network model and that therefore the distinc- 


tion is vacuous. 
formulations. 


Smith, Shoben, and Rips (1974) have pro- 
posed a feature model of semantic memory. 
They contend that the model, in terms of the 
underlying structural representation, is a set- 
theoretic rather than a network model, follow- 
ing a distinction formulated earlier (Rips, 
Shoben, & Smith, 1973). I would like to take 
this distinction to task. My thesis is that their 
set-theoretic model can, without loss of explica- 
tory power, be formulated as a network model 
and that therefore the distinction is vacuous. 

Smith et al. (1974) state that "network 
models assume that words, or their counter- 
parts, exist as independent units in semantic 
memory connected in a network by labeled rela- 
tions" (p. 214; Rips et al., 1973, p. 2), while 
set-theoretic models represent concepts by sets 
of elements. In the Smith et al. (1974) feature 
model, the elements associated with a concept 
are the defining and characteristic features of 
the concept. 

It would be instructive, then, to formulate 
the Smith et al. (1974) model as a network. 
A network (Harary, 1969) is a digraph to- 
gether with a function which assigns a value to 
each edge of the digraph. The digraph, in this 
case, would need to consist only of ordered 
pairs of concepts and features. For example, 
the ordered pair (Ci, I's) would indicate that 
concept C: had feature Fy and would be rep- 
resented in the network by a directed edge from 
C: to F;. The value assigned to each edge 
would indicate the importance of a given fea- 
ture to the definition of a concept, and, based 
on some threshold value, the features associated 
with a concept could be partitioned into defin- 
ing and characteristic features. Given this 
definition, it is a trivial exercise to redraw, 
for example, Figure 1 of Smith et al. (1974, 
p. 216) as a network. 


Requests for reprints should be sent to James D. 
Hollan, Department of Social Sciences, Clarkson 
College of Technology, Potsdam, New York 13676. 


Some reasons are also given for preferring network 


The difficulties that Smith et al. (1974) see 
confronting network models are, by Smith 
et al.’s own argument, "due to their failure to 
include structural assumptions that would make 
category membership a matter of degree” (p. 
221). Since it is quite natural to formulate the 
Smith et al. (1974) model itself as a network, 
such criticism cannot be directed at network 
models in general but possibly can be leveled, 
as they contend, at specific network models 
(e.g, Quillian, 1969), although there is an 
argument (Collins & Loftus, Note 1) that even 
here the criticism is misapplied, resulting from 
a misinterpretation of Quillian's (1969) theory. 
More to the point, the argument against net- 
work models seems to be one which might 
better be put against digraph models in which 
no values are assigned to the edges, Even here, 
though, one could contend that it would be pos- 
sible to represent defining and characteristic 
features within a digraph by labeling the ap- 
propriate edges as defining or characteristic. 
But in this case, if one is to use the terms in 
their mathematical sense, technically one either 
has two digraphs, one imposed by the defining 
relation and another imposed by the character- 
istic relation, or a network in which the func- 
tion assigns nonnumerical values (for a dis- 
cussion of this latter, see Harary, Norman, & 
Cartwright, 1965, p. 363). 

A more general question is whether set- 
theoretic and network models are isomorphic: 
In the case of current psychological models, 
contend the answer is affirmative. One ean 
map any set-theoretical model onto a network 
model by following the procedure exemplifie 
with the Smith et al. (1974) model. The sets 
associated with a given concept need only us 
transformed into a collection of ordered pee 
in which the concept serves as the first ey = 
and the elements of the sets associated with oa 
concept serve as second elements. One pen : 
a relation, specified by the collection of order 
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pairs, and any relation can be represented by 
a digraph. The function associated with the 
digraph would assign a value or identifier to 
each edge to indicate which original set it was 
derived from. By reversing this procedure, 
one can map network models onto set-theoretic 
models. It is important to note that this argu- 
ment is not meant to apply to networks as the 
term is used in computational linguistics. These 
networks are models not of human memory or- 
ganization but of natural language grammars. 
They have evolved from finite state transition 
graphs in which nodes correspond to states in 
a finite state machine. An example of such a 
network is Woods’ (1970) recursive transition 
network which is equivalent, according to 
Winograd (1972), to grammars expressed as 
programs such as PROGRAMMAR (Winograd, 
1972). In these network models there are 
many aspects (e.g., recursion, conditional arcs, 
embedding) which, as far as I can see, could 
not be represented in a set-theoretic model. 

It remains to be shown that a network 
formulation of the feature model can serve as 
an adequate structural basis for the Smith et al. 
(1974) two-stage feature comparison model of 
semantic categorization tasks and thus support 
the assertion that the network formulation has 
equal explicatory power. In the processing of 
a pair of text items, the underlying structural 
representation, whether set-theoretic or network, 
is accessed only at one point: In Stage 1 the 
features associated with the items are retrieved. 
It is obvious that the retrieval of the features 
associated with a pair of concepts in a network 
is no problem. As each edge directed from a 
concept node is traversed, the value of the edge 
and the feature to which it is connected is 
obtained. 

Smith et al. (1974) make a strong case for 
the importance of distinguishing between de- 
fining and characteristic features of concepts 
and for their process model of semantic cate- 
gorization. It is evident that their model can 
be formulated either as a set-theoretic or as a 
network model. 'There are a number of rea- 
sons, though, that one might prefer to struc- 
turally formulate their model as a network. 
Chief among these are the ability to then make 
use of the substantial literature of graph the- 
ory, an area of mathematics concerned with 
the abstract idea of structure (e.g. traversal 
algorithms such as Quillian’s, 1969, breadth- 
first search or Bower's, 1972, depth-first 
search) ; and the ease (because of the nature 
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of digital computers and the availability of list- 
processing languages) of implementing graph- 
theoretical models as computer programs, a 
practice both common and valuable (Anderson 
& Bower, 1973; Hollan, 1974; Quillian, 1969; 
Rumelhart, Lindsay, & Norman, 1972). This 
practice is valuable not only because of the 
high degree of precision that such implementa- 
tion necessitates but also because it increases 
the likelihood of fruitful interchange (e.g., Quil- 
lian, 1969; Winograd, 1972) between cognitive 
psychology and artificial intelligence, two dis- 
ciplines currently very much concerned with 
the representation of knowledge. 


REFERENCE NOTE 


1. Collins, A. M., & Loftus, E. F. 4 spreading 
activation theory of semantic Processing (Report 
No. 2711). Cambridge, Mass.: Bolt Beranek 
and Newman, 1973. 
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The choice of a network or set-theoretic representation correlates with two 
substantive differences among models of semantic memory. First, in explain- 
ing certain reaction time effects, networks models emphasize retrieval pro- 
cesses, while set-theoretic models often emphasize comparison processes. 
Second, set-theoretic models posit more semantic analysis during sentence 


verification than do network models. 


Hollan's recent demonstration of an 


isomorphism between set-theoretic and network representations cannot resolve 


these important issues. 


In an earlier article (Rips, Shoben, & 
Smith, 1973), we characterized theories of 
semantic memory by means of a distinction 
between set-theoretic and network models. 
Set-theoretic models (eg., Meyer, 1970; 
Schaeffer & Wallace, 1970) were those in 
which the meanings of words were repre- 
sented by sets of semantic elements, while net- 
work models (e.g., Collins & Quillian, 1969; 
Rumelhart, Lindsay, & Norman, 1972) repre- 
sented meanings by nodes interconnected in a 
network of labeled relations. Since formulat- 
ing this distinction, we have witnessed two de- 
velopments that appear to undermine it. We 
noted one of these in a recent article (Smith, 
Shoben, & Rips, 1974) when we mentioned the 
development of hybrid models which contain 
both network and set-theoretic assumptions 
(eg., Collins & Quillian, 1972). The other 
challenge to our "set/net" distinction is Hol- 
lan's (1975) argument that set-theoretic repre- 
sentations can be translated into networks. 
Specifically, Hollan pointed out that a set could 
be equivalently represented by nodes (corre- 
sponding to the elements of that set), each 
connected to a common node. 

Although we accept Hollan's argument, we 
think it essential to point out the following: The 
choice of one type of representation rather than 
another can lead to substantive processing dif- 
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ferences between theories, even though the rep- 
resentations themselves are isomorphic at some 
level. (See Newell & Simon, 1972, chap. 3, for 
a striking example of this possibility. 
seems to us that this has been the case for 
semantic memory models. 

Thus, network and set models differ in the 
type of process they assume to be responsible 
for certain reaction time effects (Rips et al; 
1973). The effects in question occur in à 
paradigm where one must verify statements of 
the form An S is a P (eg, An apple is a 
fruit), and are such that verification time de- 
pends on the similarity of the subject-predicate 
pair. Current network models assume that 
these effects can be attributed primarily to re- 
trieval of pathways between concept nodes; in 
contrast, set-theoretic models like those of 
Schaeffer and Wallace (1970) and Smith et al. 
(1974) suppose that these same effects are de- 
termined by comparisons of semantic elements." 
This difference cannot be resolved by showing 
that set models can be translated into network 
representations. For suppose we adopt Hol- 
lan's (1975) proposal and allow semantic fea" 
tures to be connected by directed edges to S: 
concepts they define. We now have a netWo* 


; esses 
1It should be noted that comparison pres the 


are also required in network models, 5° com- 
relations between nodes in a network caP e while 
pared to the relations expressed in a sent? tl eoretic 
retrieval processes are also needed i" d is, 
models to access the sets of information: =; 
the essential difference between the mode type 
which type of process is postulated, bu Wert 
of process is generally assumed un 
critical similarity effect. 
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representation, but we still have to decide on 
the process that underlies the verification ef- 
fects of interest, either the retrieval of stored 
pathways between nodes or the comparison of 
the feature nodes of the predicate to those of 
the subject concept. 

A related substantive issue concerns the 
amount of semantic analysis that one must per- 
form to verify a statement. In network models, 
like that of Collins and Quillian (1969, 1972), 
it is sometimes possible to confirm a sentence 
like An apple is a fruit by retrieving a single 
link in memory. In contrast, set-theoretic 
models emphasize a fuller expansion of concepts 
in the course of sentence verification. For ex- 
ample, in the model of Smith et al. (1974), sub- 
ject and predicate concepts are always expanded 
into sets of semantic features before any fur- 
ther processing is done. Again we have a sub- 
stantive difference between existent set-theoretic 
and network models that is not elucidated by 
translating the former into the latter. For even 
if we do this, and construct a network repre- 
sentation of the Smith et al. model as Hollan 
does, we must still come to grips with the issue 
of how much semantic expansion to posit. 

These two issues seem to us to be important 
ones in present theories of semantic memory, 
and it is clear that sides taken on these issues 
were encouraged by the choice of a network or 
set representation. Partly this is due to a cer- 
tain type of representations being more amena- 
ble to certain types of processes. But apart 
from these substantive matters, there seems to 
be little reason to prefer networks to sets, except 
as a matter of theoretical convenience. While 
Hollan (1975) cites the literature in graph 
theory as a reason to prefer networks, work on 
set theory seems at least as adequate an under- 
pinning for semantic memory models. Indeed, 
some of the best-developed models within formal 
semantics depend on a set-theoretic framework 
(eg. Montague, 1973; Suppes, 1974). Also, 
implementation in terms of computer simula- 
tions can sometimes be as simple for set the- 
ories as for network theories, because of new 
computer languages (Raphael & Bobrow, 1974) 
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that use sets as well as list structures (though 
network structures may still be preferable when 
retrieval problems are an important considera- 
tion). To sum up, then, the choice between 
existent network and set-theoretic models re- 
mains a matter of substantive concerns. 
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Woods, Decke, and Vasseli recently pro 
mone ratio, I/GH, is the signal by w 


posed that the insulin - growth hor- 
hich body weight and feeding are 


regulated. Four major criticisms of the theory are discussed: (1) The 


I/GH ratio does not seem to have the p 


roper characteristics to be a regula- 


tory signal for feeding; (2) the correlation between I/GH ratios and feeding 
may have been prematurely elevated to causal status; (3) available evidence 
does not support the existence oí a direct relationship between the I/GH 


ratio and food intake; and (4) in terms 


theory can be generated from insulin levels alone. 


role of hormones in the control of feed; 


of parsimony, all predictions of the 
It is suggested that the 
ing is more likely to be understood 


in terms of modifications in cellular energetics rather than in terms of any 


direct signaling function. 


The precise manner in which mammals main- 
tain relatively stable body energy balance has 
eluded definitive identification. Recently, Woods, 
Decke, and Vasselli (1974) proposed a possible 
mechanism by which body weight and food in- 
take may be regulated. Essentially, they sug- 
gested that the regulation of body weight (or 
more precisely, adipose tissue) is determined 
by the relative secretions of insulin (I) and 
growth hormone (GH). When the circulating 
I/GH ratio increases, body weight and food 
intake increase; when the ratio decreases, body 
weight and food intake decrease. 

Although there is no doubt that both insulin 
and growth hormone are of cardinal impor- 
tance in the proper handling of body nutrient 
transactions, the theoretical import of the above 
proposal hinges largely on the manner in which 
changes in the I/GH ratio came to modify feed- 
ing. Without correlated changes in food intake, 
changes in the I/GH ratio could have only a 
small effect on body weight. In this reply, it 
will be argued that neither rational considera- 
tions nor available empirical evidence supports 
the idea that a change in the I/GH ratio is the 
antecedent condition for the occurrence of 
regulatory feeding behavior. Four critical ques- 
tions are posed: (1) What kind of regulatory 
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information could the I/GH signal mediate? 
(2) Are systemic changes in insulin and growth 
hormone causes or consequences of feeding? 
(3) Does available experimental evidence in- 
dicate any reliable relationship between the 
I/GH ratio and feeding? (4) Is the proposed 
theory parsimonious? The position taken 
herein is that the role of both insulin and 
growth hormone in feeding and weight control 
is to be understood simply in terms of the meta- 
bolic changes these hormones promote. In 
other words, both hormones constitute part of 
the regulatory outflow, rather than inflow, of 
systems which organize the maintenance of 
stable body energy balance. 


Tue BASAL Insuttn-Growrn Hormone RATIO 
AS A REGULATORY SIGNAL 


Woods et al. (1974) state that 


when body weight and the insulin- growth hor- 
mone ratio are out of alignment, regulatory eating 
Occurs, An increased ratio will cause an anima 
to eat much more than its usual daily ration unti 
a new appropriate weight is attained. If the rato 
is decreased, the animal will essentially stop eating 
until the new lower weight is attained” (P 7f 


Essentially, Woods et al. have elevated 1/GH 
ratio to set-point status, wherefrom it dictates 
the level at which body fat is to be mains 
via changes in regulatory feeding. But yt 
consider the types of information the 1/ int 
ratio could realistically convey. A puo in 
by its conceptual nature, must remain i 
the face of ongoing physiological and pena” os 
changes. Thus, the first problem: qu 

of both insulin and growth hormone Y 
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predictably with feeding, yielding the unlikely 
situation whereby a set-point fluctuates with 
the process it is supposed to be guiding. In- 
sulin is secreted primarily in response to food 
intake (Cahill, Herrera, Morgan, Soeldner, 
Steinke, Levy, Reichard, & Kipnis, 1966; Stef- 
fens, 1970), and growth hormone is normally se- 
creted several hours after meals (Hunter, 
Friend, & Strong, 1966) as well as during the 
first period of slow wave sleep each night 
(VanderLaan, Parker, Rossman, & Vander- 
Laan, 1970), Possibly because of these feel- 
ing-related variations in hormone release, 
Woods et al. stipulate that their ratio applies 
to “basal (fasting)” insulin and growth hor- 
mone levels. But even granting this restric- 
tion, problems remain. Though insulin does 
fall to very low stable levels in starved animals 
and humans (Goodner & Porte, 1972), the 
pulsatile release of growth hormone continues 
(Quabbe, Schilling, & Helge, 1966). If no 
momentary value of growth hormone is rep- 
resentative of fasting, the specification of a 
“basal (fasting)” I/GH ratio becomes a diffi- 
cult task. 

But even if it is assumed that an acceptable 
measure of the basal I/GH ratio exists, there 
would be problems in specifying how the regula- 
tory signal would operate from moment to 
moment. How can the basal I/GH ratio con- 
tinue to act as a set-point when it is no longer 
around—for instance, when the animal is no 
longer deprived? How does the basal I/GH 
ratio convey the set-point message through the 
nonbasal I/GH jungle? After all, the highest 
daily I/GH ratios do not correspond to periods 
of hunger, and the lowest ratios do not coincide 
with satiety. Since insulin is secreted in re- 
sponse, rather than prior to food intake (Han- 
sen & Johansen, 1970; Steffens, 1970), the low- 
est daily ratio corresponds to the urge to eat, 
and the highest ratio to the cessation of feed- 
ing. Of course, it could be argued that such 
paradoxes are not relevant to the theory, since 
the I/GH ratio only controls "regulatory eat- 
ing" not "caloric eating," but that would be un- 
satisfactory unless it is demonstrated why exist- 
ing data require two such concepts or how they 
are to be empirically distinguished. Such evi- 
dence was not presented. 

Conceivably the I/GH ratio could provide a 
stable reference set-point signal if both hor- 
mones were integrated across a considerable 
span of time, say a day. Such integration could 
readily transpire for low molecular weight 
steroid hormones which enter cells but not for 
heavy protein hormones such as insulin and 
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growth hormone which exert their metabolic 
actions at the membrane level (Cuatrecasas, 
1971). Prolonged membrane binding of these 
hormones could possibly occur somewhere in 
the body, but there is no evidence that such a 
mechanism exists. Alternatively, insulin and 
growth hormone may produce some intracellular 
"event" which is the integrated signal. But 
again. no such intracellular event, aside from 
cumilative energetic changes, is known or pro- 
posed. 


INSULIN - Growrn Hormone Ratio 
AND CAUSALITY 

A relatively good correlation between the 
average daily I/GH ratio, feeding, and body 
weight may well occur under certain conditions, 
The critical question is which is the leader and 
which is led. Undoubtedly, changes in circulat- 
ing insulin have to precede major body weight 
changes, for without insulin, nutrients cannot 
be deposited effectively in fat depots. How- 
ever, Woods et al. (1974) implicitly posit that 
the ratio antedates changes in feeding. In 
reality, the converse probably occurs—changes 
in feeding produce the striking changes in hor- 
mone levels. 

Sims, Danforth, Horton, Bray, Glennon, & 
Salans (1973) described the course of experi- 
mental obesity in men who voluntarily exceeded 
their normal daily caloric intakes for prolonged 
periods. As these individuals gained weight, 
their insulin levels increased, growth hormone 
levels decreased, and their cells became resistant 
to the action of insulin. When they resumed 
eating according to desire, hormone levels re- 
turned to normal Reductions in the I/GH 
ratio have also been described for obese ani- 
mals (York & Bray, 1973) and humans (Salans, 
Knittle, & Hirsch, 1968) during food restric- 
tion. Such data, from the vantage of the pro- 
posed theory, lead to the unlikely prediction 
that both enforced obesity and starvation would 
be self-perpetuating processes. The high I/GH 
ratios in experimentally obese human volunteers 
would tend to maintain further overfeeding, 
and the reduced I/GH ratios of the fasting 
obese individuals would reinforce reductions in 
feeding. Since this does not happen, it is likely 
that the hormone changes observed in most 
forms of obesity are the consequences rather 
than the causes of hyperphagia. 


EXPERIMENTAL EvipENCE CONTRARY TO THE 
Pnorosrp Rore or THE I/GH Ratio 


An argument for the significance of the 1/GH 
ratio might still be made if exogenous modifica- 
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tions of the ratio always produced the predicted 
changes in feeding behavior. Woods et al. sum- 
marize some evidence to indicate that such is 
the case. The most compelling support comes 
from the demonstration that exogenous insulin 
can produce a striking hyperphagia and weight 
gain in some species, most notably rats. It 
should be emphasized, however, that in other 
species, goats and sheep for instance, feeding 
is not increased by insulin (Baile & Martin, 
1971; Baile & Mayer, 1968). But even in rodents, 
the direct role of insulin in hyperphagia would 
be more compelling if experimental reductions 
of insulin secretion led to the opposite effect. 
In fact, chronic insulin insufficiency (diabetes 
mellitus) also produces hyperphagia (Kuma- 
resan & Turner, 1965), and this appears to re- 
flect real energization of feeding rather than a 
mere compensatory response to glucose lost 
in the urine (Panksepp & Nance, 1972). Acute 
diabetes produced with mannoheptulose also 
does not reduce feeding reliably (Panksepp, 
Tonge, & Oatley, 1972). though very high 
doses of anti-insulin serum can inhibit feeding, 
probably as a consequence of debilitating keto- 
sis (Anderson, Kilbourn, Robinson, & Wright, 
1963). 

It should also be noted that diabetic hyper- 
phagia is effectively reduced by treatment with 
insulin (Young & Liu, 1965). Thus a low 
I/GH ratio (i.e., chronic diabetes) yields more 
feeding than a much higher I/GH ratio (i.e., 
diabetic on insulin therapy). Furthermore, in- 
sulin-induced hyperphagia and weight gain can 
be prevented or reversed simply by concurrent 
administration of catabolic hormones such as 
dexamethasone ( Panksepp et al., in press). Such 
exceptions to the theory of Woods et al. are pre- 
dictable consequences of the metabolic effects of 
these hormones. In the diabetic animal, exoge- 
nous insulin again allows cells to store and use 
energy-yielding substrates. In animals treated 
with dexamethasone, the potential anabolic ef- 
fect of insulin is probably counteracted by cel- 
lular insulin-resistance (Akerblom, Martin, & 
Gary, 1973) and accelerated fat and protein 
catabolism. Since insulin antagonism is also 
present in individuals having high endogenous 
growth hormone levels (Merimee & Rabin, 
1973), it is not surprising that acromegaliacs 
have high basal insulin levels (Luft, Cerasi, & 

Hamberger, 1967) and that the abnormality is 
corrected when growth hormone levels are low- 
ered (Karam, Grodsky, Pavlatos, & Forsham, 
1965). "Thus, cited evidence in support of the 
theory, such as growth hormone injections 
counteracting the effects of insulin (Zierler & 
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Rabinowitz, 1963), is best explained by an- 
tagonistic metabolic effects of these hormones 
rather than by direct changes in I/GH ratios. 

Woods et al. (1974) also endeavor to docu- 
ment that reduction of systemic growth hor- 
mone increases feeding and that increases in 
growth hormone decrease feeding. Unfor- 
tunately, most of the published evidence does 
not agree with such a conclusion. The hy- 
pophysectomized animal is still the best avail- 
able preparation for determining how reduction 
of growth hormone affects physiological pro- 
cesses. Although considerable ambiguity rises 
from accompanying hormone imbalances, it is . 
clear that the most common effect of hypophy- 
sectomy on feeding is reduction of intake 
(Goldman, Schnatz, Bernardis, & Frohman, 
1970; Smith, 1927). Furthermore, hypophy- 
sectomy can prevent the hyperphagia and obes- 
ity which is normally produced by gold-thio- 
glucose-induced damage of the ventromedial 
hypothalamus (Redding, Bowers, & Schally, 
1966), reduce the obesity of genetically obese 
(ob/ob) mice (Herbai, 1970), and reduce the 
food intake of weanling rats with ventromedial 
hypothalamic lesions (Goldman et al., 1970). 
All of these reductions in feeding speak against 
the proposed theory. 

Changes in feeding after administration of 
exogenous growth hormone are also contrary 
to expectations. Feeding is increased by growth 
hormone injections (Andik, Sardi, & Schmidt, 
1966; Goldman et al., 1970 Pfaff, 1969) and 
growth hormone secreting tumors (Akerblom, 
Martin, Gary, & Moscarello, 1972), Of course, 
the concurrent weight increases can be recon- 
ciled with the proposed theory since growth 
hormone increases lean body mass at the ex- 
pense of adipose tissue. Still, if the I/GH ratio 
also controls feeding, the theory should predict 
that all relative increases in growth hormone 
would tend to decrease feeding, and that clearly 
is not the case. P 

In support of their theory, Woods et al. cite 
the work of York and Bray (1972) where daily 
injections of growth hormone markedly reduced 
weight of hypophysectomized rats with ventro- 
medial hypothalamic lesions. However, in that 
same study, growth hormone increased body 
weight of control animals, Since ventromedia 
hypothalamic lesions increase insulin secretion 
growth hormone injections should have E 
the I/GH ratio least in lesioned animals. ‘the 
accordingly, the controls should have ch 
most weight. This did not happen. Tot ae 
feeding and body weight can also be uem 
by growth hormone in hypophysector 
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weanling rats with medial hypothalamic dam- 
age (Goldman et al., 1970). 

In summary, no consistent support exists for 
the thesis that high I/GH ratios increase and 
that low ratios decrease feeding behavior of 
normal animals. In fact, most of the evidence 
summarized indicates that if a regulatory the- 
ory of feeding behavior were to be based on an 
I/GH ratio signal, it would probably operate 
in just the opposite fashion to that proposed by 
Woods et al. (1974). For instance, most of 
the reviewed data could be explained consist- 
ently if a genetically determined set-point ref- 
erence signal in the hypothalamus reflected a 
desired I/GH level, and that the actual circulat- 
ing level served as the error signal for regula- 
tory control of feeding. If the level were higher 
than the prescribed set-point level (as would 
occur just after eating), the animal should stop 
eating. Conversely, were it lower (as might 
occur several hours after a meal). the animal 
would tend to eat. But until there exists direct 
experimental evidence for any such mechanisms, 
all theories of this nature will be academic. 

Presently the simplest and most reasonable 
hypothesis concerning the normal role of in- 
sulin in feeding behavior is that it mediates 
satiety processes. Normally, insulin secretion 
occurs after feeding and allows absorbed nutri- 
ents access to intracellular metabolic machinery. 
Only in pathology might insulin begin to lose 
its satiating action by depleting blood nutrients 
too rapidly, thus hastening the passage of meta- 
bolic satiety, or because insulin receptors in 
membranes have become insensitive to the ac- 
tion of circulating insulin, 
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Finally, it seems that the proposed theory is 
not theoretically parsimonious. The evidence 
cited by Woods et al. (1974) does not require 
inclusion of both hormones for generating their 
predictions. Insulin would do alone; even 
growth hormone might. Since levels of the two 
hormones are usually reciprocally related in the 
body, very little additional predictive value is 
gained by a combination of the two. Of course 
reality is more complex than most theories, and 
the accurate depiction of the underlying physio- 
logical machinery may require equally complex 
explanatory constructs. But that is an em- 
pirical question, and no evidence was presented 
which unequivocally necessitates formation of 
a combinatorial insulin-growth hormone signal. 

But it should be emphasized that an inclusive 
theory of body fat regulation (ie, in health 
as well as pathology) is unlikely to be formu- 
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lated on insulin alone. Although it is widely 
recognized that basal insulin levels are directly 
related to the degree of obesity (Bjérntorp, 
Bergman, Varnauskas, & Lindholm, 1969; 
Rabinowitz, 1970), it is generally believed that 
insulin is, at best, permissive in the genesis of 
excess fat deposition. That insulin levels are 
not pathologically causal is indicated by experi- 
ments where insulin levels, body weight, and 
feeding are dissociated. Rapid weight gain can 
be produced with high fat diets with no in- 
crease in insulin secretion (Friedman, 1972; 
Lemonnier, Note 1). Adrenalectomy mark- 
edly reduces insulin levels of genetically obese 
mice while having little effect on their terminal 
level of obesity (Naeser, 1973).  Obesities 
characterized by increased number of adipo- 
cytes, as opposed to enlargement of fat cells, are 
usually accompanied by normal insulin levels 
(Bjórntorp, 1972). Additional dissociations 
between feeding, insulin, and growth hormone 
levels and adiposity have been reported in 
weanling rats with various combinations of hy- 
pothalamic lesions and diabetes (Bernardis, 
Chlouverakis, Schnatz, & Frohman, 1974; Gold- 
man et al., 1970; Goldman, Schnatz, Bernardis, 
& Frohman, 1972). 


CONCLUSION 


There is no question that body weight and 
feeding behavior will be modified by hormonal 
changes and that insulin and growth hormone 
will be among the main participants. Also, 
there is no doubt that levels of these and other 
hormones are modified in many forms of weight 
regulation disorders (see Bray and York, 
1971). It is even likely that high endogenous 
insulin levels contribute to the weight gains and 
possibly to the overeating of some obesities. 
These facts and ideas have been around for a 
decade. Woods et al. (1974) have provided us 
with a fine summary of some of the recent 
work related to these issues, but until data di- 
rectly implicates certain processes as being 
causal, there is little advantage to reifying se- 
lected correlations observed in the patholog- 
ically obese, or in assuming that such changes 
reflect the nature of normal regulatory mech- 
anisms. As pointed out recently by Bray, York, 
and Swerloff (1973), obesity can be positively 
correlated with adrenal, pituitary, ovary, uterus, 
and liver weight—even when the obesity is pre- 
vented by underfeeding. One might reason that 
these changes may be even more basic to the 
etiology of obesity than an elevated I/GH ratio 
—after all these factors remain stable when a 
fat animal’s body weight is forcibly reduced. 
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Correlations derived from pathological cases 
may obscure or caricature as well as sometimes 
clarify the normal functions of hormones in the 
body. Many hormones have biphasic effects 
as functions of both dosage and time. For in- 
stance, low levels of glucocorticoids can stim- 
ulate feeding (Grossie & Turner, 1965) while 
high levels produce profound decrements in 
weight and food intake (Stevenson & Franklin, 
1970). Injections of insulin which eventually 
stimulate feeding initially suppress feeding 

(Lovett & Booth, 1970). 

It presently remains likely that both insulin 
and growth hormone modulate feeding solely 
via their anabolic and catabolic influences on 
intracellular energy metabolism. The energy 
state of critical cells, probably in the hy- 
pothalamus, determines when feeding and sub- 
sequent spurts of hormone secretion should oc- 
cur. Furthermore, if it is the energy state of 
certain cells that acts as the error signal, then 
many of the metabolic influences which act on 
those cells would come to bear on the strength 
and direction of the resultant regulatory signal. 
Though insulin and growth hormone would un- 
questionably be among the major contributors 
to such a control process, they would not 
be the only ones, and their influence would 
probably be understood with reference to their 
effects on cellular energetics rather than respec- 
tive blood levels. No doubt there are many 
situations where hormone levels will fail to 
adequately reflect cellular energetics. For in- 
stance, in many forms of obesity, cells exhibit 
a marked resistance to the normal action of 
insulin (Bray & York, 1971), which may reflect 
decreases in the number of membrane insulin 
receptors (Goldfine, Kahn, Neville, Roth, Gar- 
rison, & Bates, 1973) or faulty transmission of 
the insulin signal to intracellular metabolism 
(Livingston, Cuatrecasas, & Lockwood, 1972). 
These changes themselves may be the conse- 
quence of pathological secretion of glucocorti- 
coids and growth hormone (Emmer, Gorden, & 
Ro, 1971). In such conditions, insulin secre- 
tion reaches abnormally high levels—probably 
reflecting a homeostatic endeavor of the system 
to increase transmembrane transport of nutri- 
ents. It is likely that the increased food intake 
shown by organisms with such pathology is 
simply another manifestation of the same ho- 
meostatic endeavor. This may well lead to 
good correlations between insulin levels, I/GH 
ratios, body weight, food intake, and probably 
a multitude of other factors. But in the ab- 
sence of any compelling evidence, the proposal 
of Wood et al., suggesting that such changes in 
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feeding are a direct consequence of one of the 
correlated parameters of body energetics (i.e. 
changes in the I/GH ratio) rather than a basic 
derangement of ongoing energy exchange, is 
gratuitous. Though pathology can often reveal 
the underlying principles of control systems, 
one must cautiously sift the causes of the pathol- 
ogy from the consequences. 

Clearly, hormones which are important par- 
ticipants in regulatory processes, need not in 
themselves be meaningful regulatory signals. 
Although it is possible that some hormone(s) 
will eventually be found to act as unique signals 
to central nervous system regulatory mech- 
anisms, until so demonstrated, it might be well 
to continue along traditional paths of seeking 
the role of hormones in feeding among the in- 
tracellular metabolic processes which are pro- 
moted or hindered. 
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there is no contrary data. In fact, evidence for 
insulin-sensitive sites in the central nervous sys- 
tem is becoming impressive (see Woods and 
Porte, 1974, for a review). (2) "Are systemic 
changes in insulin and growth hormone causes 
or consequences of feeding?" (Panksepp, 1975, 
p. 158). There is no doubt that food intake 
per se can change the I/GH ratio. In fact, 
we explicitly stated that "changes of the ratio 
can be induced by . . . forced variation in 
the amount of food consumed" (Woods et al., 
1974, p. 27). It should be emphasized that our 
theory of body weight regulation predicts only 
those episodes of eating which reflect discrep- 
encies between body weight and the I/GH ratio 
(ie. regulatory eating). Since most episodes 
of food intake, particularly in normal animals 
(ie “caloric eating"), are mot triggered by 
such discrepencies—but rather are external to 
the proposed system—they were not strongly 
considered. Panksepp repeatedly failed to make 
this distinction. It is well known that a variety 
of conditions, including environmental tempera- 
ture, exercise, palatability of diet, and social 
conditions of eating, to name a few, can influ- 
ence food intake and, depending upon their in- 
tensity and duration, can present challenges to 
the ideal (or prevailing) I/GH ratio. Any in- 
duced changes of the ratio, however, will be 
maintained only as long as the external condi- 
tions which alter feeding prevail. The fact that 
experimentally overfed obese subjects (e.g., 
Sims, Horton, & Salans, 1971) easily revert 
to their normal body weight (as well as the 
fact that naturally obese individuals easily re- 
vert to their high body weight) once allowed 
to free feed is not contradictory to our theory 
as Panksepp claimed. Our theory predicts that 
excessive food intake leads to an increase of 
the I/GH ratio and causes body weight (fat) 
to increase. This process, however, is not self- 
perpetuating as Panksepp claimed since the 
ratio will be increased only as long as the exces- 
sive food intake is maintained. Once this ex- 
ternal influence on insulin and growth hormone 
secretion is removed, the I/GH ratio declines, 
regulatory eating (fasting, in this instance) 
occurs, and the weight and the ratio are re- 
turned to the central nervous system-controlled 
levels. However, it seems possible that more 
chronic changes of feeding might eventually 
lead to functional adaptations of the secretory 
glands and consequently to more permanent 
deviations of the prevailing I/GIT ratio from 
its normal value. (3) "Does available experi- 
mental evidence indicate any reliable relation- 
ship between the I/GH ratio and feeding ' 
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(Panksepp, 1975, p. 158). The cited examples 
of chronic insulin insufficiency accompanied by 
hyperphagia are not valid as counterevidence 
to our theory since they are instances of caloric 
eating (to comply with energy requirements) 
rather than of regulatory eating (to bring 
body weight into alignment with the 1/GH 
ratio). In additon, Panksepp failed to mention 
that when insulin was lowered in all of the 
studies he cited, body weight fell in spite of the 
acute hyperphagia, exactly as our theory pre- 
dicts. Further, in the studies Panksepp cited 
in which growth hormone was administered 
and in which fat stores were analyzed, the fat 
stores actually decreased although lean body 
mass may have increased resulting in a slight 
increase in overall body weight. 

A second area of contention concerns the 
effects of hypophysectomy upon body weight. 
As we pointed out (Woods et al., 1974, p. 33), 
removal of the pituitary as a means of reduc- 
ing growth hormone is a crude measure at 
best because many other hormones secreted 
by the gland are simultaneously eliminated. 
The findings on animals with ventromedial 
hypothalamic ( VMH) lesions in addition to hy- 
pophysectomy, which Panksepp cited, therefore 
make a weak counterargument to our theory, 
especially since most of the data on such rats 
supports our theory. The VMH lesions de- 
crease the growth rate, especially in young rats 
(Bernardis & Skelton, 1966). It is therefore 
not surprising that VMH-lesioned hypophysec- 
tomized rats lost weight when injected with 
growth hormone and that hypophysectomized 
controls without VMH lesions gained weight 
(York & Bray, 1972). Most likely. the controls 
gained lean body mass such that the proportion 
of body fat decreased, as our theory predicts: 
Cox, Kakolewski, and  Valenstein (1968) 
showed that VMH-lesioned hypophysectomized 
rats gained more weight per gram of food s. 
take than nonhypophysectomized but VMI t 
lesioned controls. There is also evidence thes 
hypophysectomized animals have a greater pro 


tion of body fat than controls (Han, br 
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same conclusion that we have regarding insulin 
and feeding, stating that “The VMH appears 
to modulate feeding in response to changing 
levels of systemic insulin” (Panksepp & Nance, 
1972, p. 451). (4) "Is the proposed theory 
parsimonious?” (Panksepp, 1975, p. 158). 
Panksepp suggested that insulin alone (without 
growth hormone) provides as much information 
as the ratio and that the observed effects of 
growth hormone on body weight could be sim- 
ply explained by insulin antagonism of growth 
hormone (p. 158). We basically agree with 
this statement. However, we prefer the con- 
cept of an I/GH ratio because it is more easily 
operationalized than "functional" insulin. 

In his review, Panksepp also claims that 
there are many instances of dissociation be- 
tween the ratio and body weight, citing seven 
references. Of the seven, five are instances 
in which eating and insulin, not weight and 
insulin, occur in what Panksepp considers a 
strange relationship. It is because of situa- 
tions like these that we purposely distinguished 
between caloric eating and regulatory eating 
and focused upon body weight regulation. Of 
the remaining two citations, one is unpublished 
and cannot be evaluated. The final reference 
(Bjórntorp, 1972), according to Panksepp. 
states that certain obese states are accompanied 
by normal insulin levels. A careful reading of 
Bjórntorp, however, reveals that he reached 
the opposite conclusion, the one predicted by 
our theory (Bjérntorp, 1972, pp. 124 & 134).2 

Finally, let us return to Panksepp’s (1975) 
two major criticisms; that is, that the theory 
“hinges largely on the manner in which changes 
of the I/GH ratio come to modify feeding” and 
that “neither rational considerations nor avail- 
able empirical evidence supports the idea that 
a change in the I/GH ratio is the antecedant 
condition for the occurrence of regulatory 
feeding behavior" (p. 158). We discussed 
caloric eating above and it should be clear that 
caloric eating is not a necessary condition for 
changes of body weight. 

Many experiments show that the I/GH ratio 
can alter body fat in the absence of changes of 
food intake (e.g., Bernardis & Frohman, 1970; 
Frohman & Bernardis, 1968; Han, 1967 ; Hust- 
vedt & Lovo, 1972). Additionally, some more 


* This point is somewhat controversial since 
Bjórntorp (1972) hypothesizes that insulin and 
body weight might occasionally be out of align- 
ment, citing unpublished data as support (e.g., pp. 
123 and 135). All of the data presented by Bjórn- 
torp, however, agree with our theory. 
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recent research supports our conception of the 
control of regulatory eating. Although the 
evidence that a lesion of the VMH causes an 
increase of insulin that precedes hyperphagia 
and obesity was scant when the theory was 
first published, it is now well established. 
When the VMH is functionally lesioned in 
awake rats by use of the local application of 
a reversible anesthetic, insulin values are sig- 
nificantly elevated within 3 min in the absence 
of food intake, and they return to normal when 
the anesthetic wears off (Kulkosky, Porte, & 
Woods, in press). Further, two other recent 
studies have reported that the increase of insulin 
occurs when hyperphagia is prevented following 
VMH lesions to rats ( Martin, Konijnendijk, & 
Douman, 1974; Tannenbaum, Paxinos, & Bindra, 
1974). Finally, Hustvedt and Lovo (1972) 
showed that the hyperinsulinemia that occurred 
in rats with VMH lesions but restricted food 
intake could be used to predict their subsequent 
gain of weight given hyperphagia. Taken to- 
gether, these studies strongly imply that there 
is an increase of insulin following a VMH le- 
sion and that the magnitude of the increase de- 
termines the subsequent gain of weight and 
duration of hyperphagia, exactly as the theory 
predicts. Finally, Powley and Opsahl (1974), 
in an elegant recent experiment, showed that 
the obesity following a VMH lesion is elim- 
inated following section of the vagus nerves, 
the nerves which carry impulses from the cen- 
tral nervous system to the pancreas and in- 
fluence insulin secretion (see Woods and Porte, 
1974, for a review). 

In conclusion, although we, as well as Pank- 
sepp (1975), feel that something as simple as 
the I/GH ratio cannot account for all forms 
of body weight regulation, it does so better than 
other theories. Further, the fact that the large 
number of citations used by Panksepp to criti- 
cize the theory acually are compatible with it 
suggests to us that we may be on the right 
track. 
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A Balance Theory Interpretation of Dissonance 


Chester A. Insko, Stephen Worchel, Robert Folger, and Arūnas Kutkus 
University of North Carolina 


An argument was developed that the crucial cognitive band in the Festinger 


and Carlsmith paradigm situation is as follows: T. 
(?) arguing that the task is enjoyable (—). / 
evaluated and that the counterattitudinal ad 
issue becomes whether or not the subject wil 
an analysis of the meaning or meanings of the term ri 
cluded that responsibility will most lik 
aan B In this case, 
ae ana alanced. Such imbalance is then reduced by a cl 
jm t a iue act. Although such a change co! 
Eus s ject deciding that it is not too bad to mislead someone, 
id m achieved by the subject deciding that 
id be s since I really do believe that the ta 
ae is also leads to an explanation of Calder, Ross, 

, as well as to a prediction that a manipulation of inv 
opposed to reward—along with choice an 


interaction. This prediction was confirmed. Final 
as briefly sketched, along with an analysis 


choice and low reward. 


other forced-compliance situations w: 
of free-choice situations. 


In — 
ed e ee and Carlsmith intro- 
Paradigm fo as come to be the best known 
After Subien the study of dissonance. 
a LI an initial period work- 
ring task, they were offered 


estinger i Na condition. Thus 
Onsistent wi Carlsmith obtained results 
Bonobyio with one of dissonance theory's 
the ern Predictions, namely, the less 
inal advo used to induce counterattitu- 
“Range in ü acy the greater the attitude 
"he Susi direction of advocacy. 
Ulate al inger and Carlsmith study stim- 
arge number of subsequent investi- 


he self (+) is responsible for 
Assuming that the self is positively 
lvocacy is negative, the remaining 
1 accept responsibility. Following 
esponsibility, it was con- 


ely be accepted when there is both high 
however, the above cognitive band be- 


hange in the evalua- 
uld be achieved 
the change is 
“I did not mislead the 
sk is enjoyable.” The above 
and Insko's reinforcement 
estment of time—as 


d consequences would produce a triple 
ly, a balance explanation of 


Some of the studies used the same 


gations. 
In 


dull-task procedure and some did not. 
all of the studies, however, different levels 
of reward were associated with some form 
of counterattitudinal advocacy. The re- 
sults of these studies are seemingly incon- 
sistent. Some studies (e.g., Cohen, 1962) 
found the same inverse or negative rela- 
tionship between reward and attitude 
change found by Festinger and Carlsmith. 
Such an inverse relation has come to be 
known as a dissonance effect. Other 
studies (e.g, Elms & Janis, 1965 ; Rosen- 
berg, 1965), however, found a direct or 
positive relation between reward and atti- 
tude change, which has come to be known as 
a reinforcement effect. And still other 
studies (e.g, Melson, Calder, & mia 
1969) found no relation between rewar 
and attitude change. This large li 


terature 
has been recently 


and exhaustively re- 
viewed by Collins and Hoyt (1972). 
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TABLE 1 


MEAN ENJOYABLENEsS IN Hicn-CoxsEQuENCEs- 
Actors CONDITION OF THE CALDER, Ross, 
AND INsKo Srupva 


Reward High choice Low choice 
$ hour 21.33 9.60 
2 hours 14.80 16.20 


Note. Control actors = 13.20. All ratings on a 36-point scale, 
with higher numbers indicati; 


ng more enjoyment, 
* Adapted from Calder, Ri 


055, and Insko (1973, p. 91). 


Calder, Ross, 
tempted to determi 


reward (4 hi 
choice (hig! 
or low), and 


advocacy, 

quences condition the 
and also stated that 
was interesti 
and b 


Stooge w, 


1 as convinced 
Since th 


manipulated by 
with an Observ, 
Subject Observed 
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dition are presented in Table 1. These 
means are based on ratings of the task on a 
36-point scale, in which high numbers indi- 
cate the most enjoyableness. The means 
for the observer-subjects revealed a similar 
double-interaction pattern within high con- 
sequences. Control subjects who simply 
worked on the dull task and then rated its 
enjoyableness had a mean rating of 13.20. 
is control mean differed significantly 
only from the mean in the high-dissonance 
cell, which was 21.28. Although the vari- 
ables were operationalized somewhat dif- 
ferently, Collins and Hoyt (1972) obtained 
a similar triple interaction, in a study which 
further found that the greater amount of 
attitude change in the high-dissonance cell 
was still present on a 2-week delayed 
Posttest, 
al thesis of the present article 


ance theory (Heider, 1946, 1958) 
or affective-cognitive consistency theory 

Osenberg, 1956, 1965; Rosenberg & 
Abelson, 1960) Provides 


a framework that 
for the above dis- 
alts and indeed all dissonance 
ourse, that Heider, 
erg and Abelson did 
Consistency theories 
ather, they presented 
as general perspectives on the 
imbalance in Cognitive bands 
Cognitively linked attitude ob- 
Important problem of specify- 
elevant c gnitive bands for par- 
ations is largely untreated. It 
at the important cogni- 
Festinger and Carlsmith 
is one that involves the 

: CF) is responsible for (?) 
ng that the dull task is enjoyable 
this point the only sign that has 
© Positive one for the self- 
Se, explicitly assumes that 
€ subject holds himself 1” 
Although the assumption mà 
'Versally True, conimon sense ind a 
P Certainly true for the majorit 


: ce 
mphasis on the importan h 

and the Self’s Maintenance of a 

XPlanation of dissona? 


l, 
ronson (1968), N€ 


ories 
resolution of 


Le, two 
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Helmreich, and Aronson (1969), and Brown 
(1965). í 
^ Mist doux the sign of the second object 
€ m significance in the cognitive 
hio 4. vocating that the dull task is 
Eod n yi Counterattitudinal advocacy 
je ot always be unpleasant or negative. 
E. UNE INIMA be instances, in a debate 
Steen s in which counterattitudinal 
in ilm a un. We believe, however, that 
mk estinger and Carlsmith (1959) 
E oro d counterattitudinal advocacy is 
this rsen d to be negative. After all, in 
mislead : ion the subject does attempt to 
of peni ini that is, to lie. The degree 
too Sieh rs however, is probably not 
Ha which » a low-consequences condition, 
misleadi ue subject was not successful in 
ines ng the waiting subject. In a high- 
Eius nines condition, however, the de- 
extreme TEBET VOTES is obviously more 
Calder i Ls is particularly true in the 
ject not "es ( 973) study, in which the sub- 
also preve y misled the waiting subject but 
Upenmin ed him from studying for an 
condition s Withina high-consequences 
SOunterstir nen, we will assume that the 
Ae ot ütudinal advocacy is negative. 
(+) is ti band thus changes: the self 
the dull rc pe for (?) advocating that 
Sis on e t is enjoyable (—). An empha- 
Consequen importance of high or negative 
Nance i in the production of disso- 
1965) a reyonelg been made by Brown 
olling ro (1968), Nel et al. (1969), 
(1979), 8), and Cooper and Worchel 
al part of the cognitive band is the 
linkage, “is responsible for." In 
Osenber anguage this is a unit relation, in 
Nition, rh language an instrumental cog- 
ance of ne first emphasis on the impor- 
lissonan Sponsibility in the production of 
í nce of which we are aware is con- 
A an unpublished paper by Cooper 
à ng in " Coper’s paper is worth con- 
are pot S ome detail because its contents 
Paper ^s nerally known and because the 
Ssue, cusses what we consider a crucial 
Denn central argument is that 
the ar ility is a necessary condition for 
mai, 9USal of dissonance” (p. 64). The 
Conceptual problem that Cooper 
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confronts involves the relationship between 
responsibility and choice. Cooper argues 
that generally “choice is relevant to re- 
sponsibility when the act of choice serves 
to confront an individual with the responsi- 
bility of his own behavior" (p. 31). None- 
theless, Cooper points out that it is possible 
to have "responsibility without choice" 
(p. 45) and “choice without responsibility” 
(p. 531). Responsibility without choice is 
illustrated by the individual with high self- 
esteem who is aware that he possesses a 
negative characteristic, and also by some 
behavior which an individual recognizes as 
being a product of the unconscious. Choice 
without responsibility occurs when an indi- 
vidual makes a choice without foreseeing 
the consequences. Cooper discusses Brehm 
and Cohen’s (1962) well-known example of 
the farmer who decides to plant tobacco 
rather than corn and then sometime later 
learns that tobacco causes a deadly disease. 
According to Brehm and Cohen the farmer 
should experience dissonance, since choice 
and volition were present. Cooper, on the 
other hand, argues that the farmer will not 


erience dissonance, since he did not 


exp 
oice and 


foresee the consequences of his ch ; 
hence will not feel responsible. According 
to Cooper the crucial issue thus relates to 
the role of “foreseeability” in the produc- 
tion of dissonance. A previously reported 
experiment by Brehm (1959) had seemingly 
indicated that unforeseen consequences, OF 
a fait accompli, could create dissonance. 
Cooper argues for a different interpretation 
of the results. a 
In 1970 the issue was attacked empiri- 
cally by Brehm and Jones in the context of 
a free-choice situation. This experiment 
demonstrated the important role of fore- 
seen as opposed to surprise consequences 1n 
the production of dissonance. Subsequent 
similar research by Cooper (1971) and 
Worchel and Brand (1972) provided further 
evidence for the importance of responsi- 
bility. In the context of forced-compliance 
situations Collins and Hoyt (1972) have 
also argued for the importance of responsi- 
bility in the production of dissonance. 
If we return to the cognitive band, k ne 
self (+) is responsible for (?) advocating 
that the dull task is enjoyable (—), it is 
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evident from the perspective of balance 
theory that the subject will not wish to 
accept responsibility for the negative conse- 
quences. If he or she does so the Cognitive 
linkage becomes positive and, according to 
the multiplicative rule (Cartwright & 
Harary, 1956), the band is imbalanced. 

e assume that two general factors will 
this matter. First are 
two objects of affective 


and Schopler 
à cogniti 


» OF null 
à perceptual « y 
unity of the associated atti * 
and negative relations involy 
tendency to Work tow. 
associated Objects, 


not respo. 
ler furt] 
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doubtedly differ in the degree of pastva 
ness or negativeness, Thus the stronge 
the tendency to perceive unity or sameness 
between two associated attitude objects 
the stronger the positive cognitive relation. 

In terms of the present situation, We 
argue that it is important to consider kr 
degree of positiveness in the cognitiv 
relation between the Self and the conse- 
quences of the act. One way to decrease 
the positiveness of this cognitive relation 
is to increase the time and physical distance 
between the self and the consequences of the 


ERE Lat 
act. Thus it is commonly observed tha 
Push-button warfare 


deadly than hand-t d 
fact, more impersonal. With hand-to-han 


r 1e maximum shocks 
that Subjects would deliver to a victum 


declined with an increase in the “immedi- 
acy of the Victim," 


future contact, the 
between self and 
© Which the victim 
Course, no logical 
onsibility should be 
Positiveness of the 
ut we believe the 
theless true, As will 
; responsibility is an 
vague cognitive relation. 
ity o confusion with other 
lS is fairly high. pat 
nt, the į int is th 
t, emarkabje number of einen Point is may 
San individual's denial of respons!” 
| negative Consequences, Coop* 
“Iscusses the fact that numero" 
sts, Philosophers, and write! 
lé general tendency 4 
.uPOnsibility for “action” 
Strating, uncomfortable 


x 4) 
. 2 Negative C " (p. 
Te Points out t Onsequences" ( 


:der$ 
enia] of . ^ Horney (1950) consid, 
aspect E 9 esponsibility an import? t 
Of neurotic Personality develop? 


Ne extent t 
There is, of 


. assumed resp 
affecteg by the genera] 


H ation, 
8eneralization ; 
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and that Fromm (1941) makes the same 
pomt regarding normal personality develop- 
Pu romm’ notion of "escaping from 
onum ectly involves a denial of 
e5 stenti] Es Y ooper also points out that 
Sartre ee quain such as May (1958) and 
(Mata x 2 na toa similar conclusion. 
fles: 8 to À artre, most of the time, we 
ne anguish in Bad Faith" (p. 95). It also 
fie out that Shakespeare had the same 


m 
Fools by 
spheric, hr heavenly compulsion; knaves . 
of SES predominance. . . . An admirable evasion 
to the cl Taster man, to lay his goatish disposition 
harge of a star!" (King Lear, 1, ii) 


I koR 
Sire appears that most people in 
bility for y ia reluctant to accept responsi- 
Done Ire consequences. From our 
intriguing deu this reluctance is 
ner, | ae it follows from balance 
tive conse iming a positive self and nega- 
bility y quences, acceptance of responsi- 

A would create imbalance. 

vet Coane that circumstances, as 
cept res we he, an individual to ac- 
in such P mer jility. As previously noted, 
imbalance i os the cognitive band is 
T'esponsibili: Assuming the evidence for 
balance ied is sufficiently compelling, 
regard Pew: be restored by ceasing to 
regard the Sel as positive or ceasing to 
tive. Sin counterattitudinal act as nega- 
Would re : lowering one s self-esteem 
Zation atia a rather massive reorgani- 
attitudes io. self-concept and associated 
evaluation i we regard a change in 
More Jikel ° zu counterattitudinal act as 
»elson rae E HI assumption Js similar to 
east effort i osenberg s (1958) principle 
na Cheers a the restoring of consistency 
How un PM arena or attitude structure. 
evaluation "i subject change his or her 
Ossibly de the counterattitudinal act? 
NOt tog bad subject could decide that it is 
“Onclusi Qe b mislead someone. Such a 
More however, is not very plausible. 
that ,P'ausibly, the subject could decide 
Pecause | id not mislead the other subject 
a chan really did enjoy the task." Such 
espere Would eliminate imbalance in the 
i mier band involving the self. Also 
T cognitive band is implicitly in- 
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volved: "My advocacy (+) pointed out 
that (+) the task is enjoyable (+).” 
From a balance theory perspective, how- 
ever, imbalance-resolving change in the 
enjoyableness of the task is dependent upon 
the subject's acceptance of responsibility 
for the counterattitudinal act. But what do 
we mean by the term “responsibility,” or, 
more exactly, how do subjects in the Fest- 
inger and Carlsmith (1959) situation con- 
ceive of responsibility? At this point, it 
may be useful to review briefly Heider's 
(1958) five "levels" of responsibility. 
Heider's levels differ both in the restrictive- 
ness of the implied definition of responsi- 
bility and in the extent to which responsi- 
bility is attributed to the environment 
rather than to the person. Fishbein and 
Ajzen (1973) label and describe these levels 


as follows: 


At the first and most primitive 
held responsible for all effects that 
are in any way associated with him. (2) Commission: 
At the next level he is held responsible if he was 
instrumental in producing the observed effects (even 
if he could not have foreseen them). (3) Foresee- 

At this level the actor is held responsible 
if he could have foreseen the effects even 
not have intended to produce 
At the fourth level he is 


(1) Association: 
level, the actor i 


abili 
only 
though he might 
them. (4) Intentionalily: 
held responsible for effects he foresaw and intended. 
(8) Justification: Finally, at the fifth level he is 
held responsible only to the extent that his in- 
tended behavior was not justified, i.e., not caused by 
environmental factors beyond his control (pp. 


149-150) 


On the fifth and most restrictive level, 
responsibility is more likely to be at- 
tributed to the environment than to the 
person. At this level Heider has what 
might be characterized as a “variance ac- 


counted for" definition of responsibility. 
ircumstances, 


Given certain environmental c 
if almost everyone acted in the same way, 
the individual in question is not responsible. 
According to Heider, it is at this level that 
"the criminal may blame the environment 
for his ill-fated career and thereby excuse 


himself" (1958, p. 114). 

It strikes us as quite likely that the 
typical subject in the Festinger and Carl- 
smith (1959) situation initially Pape see 
of responsibility as à confused digethre : 
the various levels, that is, as à fairly genera 


positive cognitive relation. However, since 
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the first three levels will not enable the 
subject to deny responsibility and avoid 
imbalance, it is probable that one or both 
of the last two levels are quickly arrived 
at. Whether or not there is any kind of 
sequential shift, we will assume that the 
typical subject mainly conceives of responsi- 
bility in terms of a mixture of intentionality 
and variance accounted for, that is, Levels 
4 and 5. 

We are not in a position to ask what 
causes someone to accept responsibility 
for negative consequences. Theoretically, 
high choice provides evidence for some 
intentionality and also indicates that every- 
one might not have acted in the same way. 
Conversely, low choice provides evidence 
against intentionality and also indicates 
that everyone might have conformed to 
the pressure and acted in the same way. 
"Therefore, choice conveys evidence for 
responsibility, regardless of whether re- 
Sponsibility is conceived of in terms of 
intentionality or variance accounted for. 
Is choice enough, however? 
clined to think not. 
Situations where the 
tween desirable alter 
that choice per se p 
cognitive glue. Although this assertion is 
Somewhat at variance with the emphasis 
traditionally placed upon choice, it is inter- 
esting to note that Calder et al. (1973) did 

ain effect for choice. 
consequences nor the low- 
as there a general 
h choice to differ from 
ation with some 
ance Or circumst: 


We are in- 
Except in free-choice 
subject chooses be- 
natives, we believe 
rovides rather weak 


: ances 
may, however, force acceptance of responsi. 
bility for negative consequences. These 
additional circumstances m 


1 1 ay include the 
expenditure of high effort Preparing for or 


creating the negative Consequences or the 
toleration of threatened or actual punish- 
ment in order to create the negative conse- 
quences. The results of Calder et al, 
indicate that the additional circumstance in 
the Festinger and Carlsmith (1959) situ. 
ation is reward. 

Reward has three functions i 
Festinger and Carlsmith situation, 
Teward provides evidence 


n the 
First, 
regarding the 
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extent to which other subjects would 
comply with the request and argue that a 
dull task is enjoyable. When the rewar 

is large, it is plausible for the subject to 
assume that other subjects would have 
behaved in the same way. Therefore, - 
the perspective of the variance-accounte i 
for definition of responsibility, a Im 
reward provides evidence for the denial o 
responsibility. Also, when the reward is 
small, it is not plausible for the subject to 
assume that other subjects would have 
behaved in the same way. A small reward 
thus provides evidence for the acceptance 
of responsibility. This may be true even 
in the case in which the consequences are 
partially unforeseen, that is, when there is 
only partial intentionality. As long as 
some of the consequences are foreseen, 
the occurrence of further unforeseen conse- 
quences may not, given the free choice of 


a small reward, deter the acceptance of 
responsibility, 


The second function of 


reward is as 
evidence of the 


experimenter's concern with 
the experiment and thus his desire or in- 
tention that the waiting subject be misled: 
We believe that subjects arc less likely to 
use a variance-accounted-for definition of 
responsibility than a definition in terms © 
one of the other four levels when they 
focus on the experimenter, This is partially 
because they are not motivated to be a$ 
analytic regarding the experimenter as they 
are regarding themselves, and partially 
because they have no evidence regarding 
how other experimenters would act in the 
Same situation, Thus, when the reward E 
large it may be quite plausible to point 5 
finger at the experimenter and indirect e 
absolve the self of responsibility. On t? 
other hand, when the reward is low a" 
insufficient such a shift in responsibility p 
much less compelling. Indeed the V@: 
minimal nature of the reward is some e 
dence that the experimenter did not CP 
undue Pressure on the subjects and 1s off 
overly concerned with the experi’ e 
Thus the Jow reward makes it less plaus! j$ 
to believe that “the nte! 
responsible,” 

The third fu 
Festinger 


experime 
1 che 


. i 
inction of the reward jig 


and Carlsmith (1959) par 


v 


| 
| 
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is as a cue to the enjoyableness of the task. 

If the subject agreed to argue that the task 

was enjoyable in return for only a low or 

minimal reward, the reward becomes a pos- 
sible cue that the task was in fact enjoyed. 

Otherwise, how is the compliant behavior 

to be explained? A high reward, of course, 

carries no such cue value. The cue impli- 
cations of low reward in the high-dissonance 

(high-choice, high-consequences) cell are 

Particularly welcome since acceptance of 

the implications allows the subject to re- 

solve imbalance in the cognitive band. 

A above line of argument is in some 

pects reminiscent of Bem's (1967) attri- 
bution interpretation of dissonance effects. 

We, however, do not endorse his somewhat 

extreme position regarding the equivalence 

of actor and observer attribution and the 
eee portance of prior attitudes. Some 

Scent research by Ross and Shulman 

cca n fact, calls into question Bem's 

(et p the importance of prior attitudes 

other ‘tae Se! McConnell, 1970). On the 

preted hand, if attribution theory is inter- 
thronot xe mean that the subject goes 
te oe, an inference process, perhaps even 
halta fie ol gerne process (Helm- 
iy a i then what we have said 
hag y compatible with at least some 
thate bo attribution theory. The fact 

study erven in the Calder et al. (1973) 

action Exe able to track the triple inter- 

the perp eee certainly does not rule out 
since b hs inferential processes. Actually, 

Were d ^ balance and attribution notions 
hardly s : orth by Heider (1958), they are 

Strange bedfellows. 

Ricans tific we are able to account for 
bust ude change that occurs in the high- 
onae” iu or "Oh my God what have I 

study E. of the Calder et al. (1973) 
; either de 1e remaining three cells of Table 

Or both Ki was low or reward was high, 
theoretical T in these cells the subject 
to resolve y should have been more prone 

bility for SUME by denying responsi- 

Should pe E e negative consequences. It 
lisonanes Mey noted that only the high- 

kx Ficus I dien from control. 
the hi ise complicating factor relating 
gh-dissonance cell involves the 


lensi 
o . 
n or nervousness that subjects may 


have felt as a result of having prevented the: 
waiting subject from studying for the quiz. 
Although the quiz was unforeseeable, in- 
formation regarding it did make the conse- 
quences even more negative. On the other 
hand, the information regarding the quiz 
may have also produced what Brock (1963) 
calls “irrelevant tension.” Brock found 
that “spreading of the alternatives” in a 
free-choice situation was more pronounced 
among children who had cheated in the im- 
mediately preceding situation and thus were 
more nervous and upset. There is therefore 
some reason to suppose that tension and 
nervousness act as a catalyst facilitating in- 
consistency or imbalance-resolving changes. 
If this is true the magnitude of the results 
in the Calder et al. (1973) high-dissonance 
cell may have been partially a function of 
such “irrelevant tension.” 

By way of restatement, we can regard the 
subjects in the high-dissonance cell as 
going through an inference process. While 
such a process may involve any of a number 
of steps or orders, one possibility is as 
follows. First, the subject confronts the 
negativity of the consequences and counter- 
attitudinal act. Second, since acceptance of 
responsibility would produce imbalance, 
there may be initial consideration of re- 
sponsibility denial. Third, in view of the 
strong evidence for responsibility flowing 
from the joint existence of high choice an 
low reward, responsibility is tentatively 
accepted, thereby creating a potential or 
actual dilemma (imbalance). Fourth an 
finally, imbalance is avoided or reduced by 
the subject's deciding that the task 1s 
enjoyable and that the waiting sube 
was not misled. Other sequences are, o 
course, possible, but we nonetheless believe 
that the subject does go through an A 
ferential process of some sort. Quite 
possibly the process occurs with consider- 
able rapidity. 

To this point we h 
explanation for everything 
by dissonance theory in the 
(1973) experiment. It is only by V" 
its absence that dissonance theory 1s re 
vant to the other three cells of the design. 
Dissonance theory, for example, does not 
attempt to account for reinforcement 


ave offered a theoretical 
that is explaine 

Calder et al. 
virtue of 
le- 
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eflects. It is true that none of the three 
remaining cells of Table 1, including the 
two reinforcement-effect cells, differs from 
control. Nonetheless we feel that they 
deserve explanation. The two reinforce- 
ment-effect cells differ significantly from 
each other, and, as previously indicated, 
other investigators have found reinforce- 
ment effects. What follows, therefore, is 
an attempt to account for the addition 
between-cells variance in Table 1. This 
explanation is based on a postulated se- 
quential-inference process. 

We will begin by noting that Calder et al. 
(1973) only obtained à reinforcement effect 
in the high- or negative-consequences con- 
dition. In the low-consequences condition 
the low-choice means are actually reversed 
in direction, although the reversal is small 
and nonsignificant. Thus, contrary 
common sense expectations, h 
tive consequences are 
Prediction of Calder et 
effect. This in turn sug: 
reinforcement effect is 
the subject’s denial of 


responsibility. 
We consider first the low-reinforcement 
Cell. Since choice wé 


as low, the evidence 
for the subject's assumption of responsi- 
bility is not Very strong. Nonetheless, the 
Situation is Somewhat equivocal in view of 
the low reward. As we noted Previously, 
low reward makes it difficult for the subject 
to shift Tesponsibility to the experimenter 
and to believe that other subjects would 
have acted in the same Way. "Thus the 
Subject may for an instant consider 
cepting responsibility, 
the subject will be forc 
imbalance to consider the negativeness or 
awfulness of the counterattitudinal act. 
In view of the negative Polarity of the 
counterattitudinal act, the partial unfore- 
seeability of the cor 


nsequences, 
of high choice, however, the 


out of the dilemma (imbalance) by denying 
responsibility. The subject May reason 
that other subjects would have been pres- 
sured into doing just what he or she did. 
Thus what we are postulating is 
tial-inference Process, in which 
bility is first considered 
Further, we believe th 


al 


to 
igh or nega- 
important in the 
al.’s reinforcement 
gests to us that the 
somehow related to 


ac- 
During this period 
ed by the potentia] 


and the lack 
subject gets 


à sequen- 
responsi- 
and then rejected, 
at such Sequential 
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inference results in an even more negative 
evaluation of the counterattitudinal act. . 

There are two possible reasons why this 
is so. First, as a result of the subject s 
having considered acceptance of responsi- 
bility for the counterattitudinal act, a 
negativity of that act becomes particular y 
salient. Second, by assuming that his oF 
her counterattitudinal act is pied 
negative, the subject can account for an 
strong denial of responsibility. The subjec d 
in effect, reasons that “since I am avoiding 
something it must be pretty awful. 
What we are postulating is a type of s 
tribution based not on overt behavior, but 
on a newly formed belief or cognitive 
linkage ("T am not responsible" or Ies 
not my fault”). If the subject regards the 
counterattitudinal act as highly negative, 
then the fact that the counterattitudinal 
act is positively associated with the task 
indicates, according to the multiplicative 
rule, that the task should be regarded 
negatively, This then accounts for the 
low mean of 9.60 obtained in the low- 
reinforcement cell, 

The situation in the hig 
cell is much less threate 
Since there is b 


h-reinforcement 
ning for the subject. 
oth low choice and high 
reward, there is even less reason for the 
assumption of responsibility, The subject 
thus may not be Severely threatened with 
the Possibility of Personal responsibility for 
the negative Consequences. As a result, 
the Negativity of those consequences may 
not be salient, and the subject’s denial of 
responsibility may be less forceful. A less 
vehement denial of responsibility would in 
turn have less of the attribution implication 


x TA 5 : t 
that “since [ am avoiding something it mus 
2€ pretty awful,” 


© same argument made for the low" 
reinforcement cell could also be made f^! 
the Ow-dissonance cell. In the late 
Instance, however, the threat of respons! 
bility Would come from high choice. 

To recapitulate briefly, we have arguo 
that the Festinger and Carlsmith à 
paradigm can best be viewed from the P 
Spective of a Cognitive band: The self t 
IS responsible for (?) advocating um o 
dull task is enjoyable (2). Under hig? i 
negative Consequences the countera 


A BALANCE THEORY INTERPRETATION Or DISSONANCE 


tie advocacy becomes negative. In the 
Wie issonance cell the combination of high 
a lom reward forces the su bject to 
creating Nee A responsibility, thus 
potential ar 2 s cognitive linkage and 
aie ale ‘satel imbalance. The low 
the task eee ts as a cue, indicating that 
resolved Mya enjoyable. Imbalance is 
Salut, r avoided by a change in the 
the ved Or the task, thereby allowing 
WES ade pee that the other subject 
iria ie = and that the counterattitud- 
NER rete = “yas not negative. In the 
lá f tien cell, where there is both 
MSS low reward, the threat of 
lends the ity Ven by the low reward 
negativity z pp first to consider the 
cacy ves a the counterattitudinal advo- 
bility. T} "i to forcefully deny responsi- 
negative ee heightened salience of the 
high pop ration paw advocacy (or 
ine ge e: plus the attributional 
SPonsibility s No forceful denial of re- 
zation of ud ea s, via a balancing generali- 
ation of ibo Ets to a more negative evalu- 
F e task, 
reader we ne hope of not straining the 
We will re E ity to the breaking point, 
Speculation var ourselves from further 
affective va] concerning the role of the 
ation Seg us of high reward. Such specu- 
Presentation vr more appropriate after 
eXperim n. 9 the data from the present 
T) ent, 
Povide in purpose of the experiment is to 
intere ee test of the balance 
results, Si - of the Calder et al. (1973) 
Sented hs. gus this interpretation Is pre- 
Nitive Filii of rapidly occurring CO8- 
test cessing, a definitive and direct 
test į The present 
Scare 
ation, 
Similar 


is diffic : 
$ difficult to achieve. 


Sub; 
. JJect's. z : pap 
Simi] assumption of responsibility and 


3 vari P 

Was ini is the investment of time, which 

and ioe to us by Wicklund, Cooper, 
inder (1967) 

; ot ft h i 

Jects o Wicklund et al. experiment sub- 
anticipated waiting either 1 minute or 
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15 minutes for exposure to a counterat- 
titudinal message. Waiting 15 minutes 
required that the subjects remain longer 
than the 30-minute period to which they 
had initially agreed. All subjects were 
tested under high-choice conditions, and 
there was no manipulation of consequences. 
None of the subjects actually waited for the 
exposure; the manipulation was of antici- 
pated waiting. The results indicated 
significantly more agreement with the anti- 
cipated message in the 15-minute than the 
1-minute condition. Wicklund et al. con- 
sidered the anticipated-waiting manipula- 
tion to be one of anticipated effort. Cer- 
tainly, however, time relates to effort only 
in a metaphorical sense. But beyond this, 
the crucial aspect of the Wicklund et al. 
manipulation may relate to the fact that 
the 15-minute subjects were asked to wait 
longer than they had originally agreed to 
wait. They were thus asked to invest more 
time. For that reason we would prefer to 
regard the manipulation as an investment 
of time or an anticipated investment o! 
Admittedly, however, even this terminology 
t reflect the fact that there was a 
15-minute 


f time. 


does no 
violation of expectations for the 
subjects. 

It was argued previously that reward has 
three functions in the Festinger and Carl- 
smith (1959) paradigm. First, reward 
provides evidence regarding the extent to 
which other subjects would comply with 
the request and argue that the dull task 
is enjoyable. Second, reward provides 
evidence regarding the experimenter's con- 
cern with the experiment and thus his 
desire that the waiting subject be misled. 
Third, reward is a cue to the enjoyableness 
of the task. We believe that the first and 
third of these functions are also true for 
investment of time. When the investment 
of time is high, it is not plausible for the 
subject to assume that other subjects would 
have behaved as he or she did. High 
investment of time thus provides evidence 
for the acceptance of responsibility. Also, 
if the subject agrees to argue that the task 
is enjoyable even when high investment of 
time is required, the investment of time 
becomes a cue that the task was in fact 
enjoyed. Otherwise, how is the compliant 
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behavior to be explained? We therefore 
predict the same triple interaction that was 
obtained by Calder et al. (1973). The 
“dissonance” effect for investment to time 
should occur when there is both high choice 
and negative consequences. Note that 
this prediction is, superficially at least, in 
disagreement with the results of Wicklund 
et al. (1967), since they had n 
tion of consequences. 

Beyond this, however, investment of time 
has one rather salient difference 
reward. Low investment of time, 
high reward, does not have any 
value. Thus any difference bet 
present data and that of Calder 
provide some circumstantia] 
regarding this matter, 


o manipula- 


from 
unlike 
affective 
ween the 
et al. will 
information 


METHOD 
Subjects 


The subjects were 126 female undergraduates en- 
rolled in the introductory Psychology Course at the 
University of North Carolina Who had Signed u 

for a “Measures of Performance” experiment, Date 


from 12 of these subjects were not included in the 
analyses, One subject (low choice, 


quences, and investment of 
in the experiment, 
either giving the subject D tei 

subject a. choice as to wh "E P5 Stving the 
the waiting subject that the task wae Would tel] 
The consequences Variable was manipulates yale. 
same way as Calder et al, In rat the 


(choice, conse- 
ere manipulated 
s manipulated by 

o 
ether or 


consequences Sh 
skeptical and then became e ul iban (SI 
periment was intere enjoyable. 
stated that since the experiment 
would stay rather than leave 
he had initially intended, 
condition the stooge remaj 
convinced. The investme 
Was modeled after the 


nt of time mani i 
Wicklund et al. 
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manipulation—the main difference being shat: Oe 
subjects actually did wait differing amoun it 
time rather than just anticipating a given MATS 
In the high-time condition the subjects had to wait 
minutes before they could meet the next subject, e 
this extra wait meant that they would not be throug! 
until ten minutes after the end of the time eee 
for which they originally signed up. In the Jod 
condition the subjects only had to wait 1 min xt 
while the experimenter checked to see if the ana 
subject had arrived. Subjects in this peur i 
expected to be finished within the time for whi 
they had originally signed up. vd by 
€yond the eight groups or conditions create ey 
the above factors, three additional groups a 
tested. One of these was a standard control grot i , 
After working on the dull task, subjects ne 
filled out the final questionnaire, There was ent 
manipulation of choice, consequences, or invested 
of time. The other two groups were tested ME rel 
high-time-anticipation, high-choice, no-consequen’ 
conditions or low-time-anticipation, high-choice, ! ne 
Consequences conditions. After working on bx 
task, subjects were given a choice as to whether 1 
not they would Wait (either 1 or 15 minutes) pe 
talk to the other subject. They were then given Mw 
final questionnaire, "There was no manipulation he 
Consequences, The intent was to see whether Tol 
anticipation of high or low investment of time, e 
Choice, and an unmanipulated anticipation of TOE 
Wences would Produce results similar to those 
icklund et al. (1967), trol, 
All of the above groups, except for the com je 
Sg tained 9 subjects, The control group conta! 


Procedure 
" he 
Since the Procedure so closely paralleled "S 
iCal and script of the Calder et al. ed 
p Y: a detailed description will not be FOP Ate 
mainly parture from the Calder et al. m op” 
» occu: i t ime (a5, 
Posed to rred for the investment of tin 


+t time 
reward) manipulation. In the high-t 
Subjects were told, 


at 
š n Apr 
have one male subject signed Uf 


ich 
miss end of the half-hour period " pur 
ject had spur m 30: fot 
Who had Started ee dup; eg. 12 


„ data 
at 12:00), but I've got some € 


Won't be able to start until ime: 


= 
rk on and 


ee -eyond the end of credited t 


3 33 " uld 
through That means that you wo he 


by about Pi after 
allotteq time; eg., 12:40) (10 minutes 


In ti 5 Id; 
he low-time Condition the subjects were t° 


to 

* One male subject signed up Ik 

now, so you could goin no up" 

_ Make sure he's show? — 7, 

ou Would be through by about “the 

efore the ull period for which e 
y) acted eg, 12:25 for 


A BALANCE THEORY INTERPRETATION OF DISSONANCE 


Assessment of Dependent Variables 


A retrieving the subject from the waiting 
: m, the experimenter set the stage for assessment 
of the dependent variables: 


uk is one more thing that I would like you to 
m he Psychology Department is taking a 
survey of students' reactions to psychology ex- 
periments. The Department has asked me to 
ie a if you would mind coming by the office 
b. ee a moment. I think they have some sort 
questionnaire they want you to fill out. 


adus the subject was greeted by a secretary 

Wire aem meure the questionnaire. Subjects 

the oficas y the experimenter as they came out of 
Th ice and fully debriefed. 

ei first page of the "Psychology Department 

^Uestionnaire" contained the following statement: 


Mes is ofa program to evaluate the effective- 
in peal i requirements for outside participation 
ene S tology courses, the Psychology Depart- 
E 2 i ag a survey of student reactions 

md p om sample of experimental s ons. 
help eros to the following questions will 
ally remi some idea of the way students typic- 
Questi A a iom Please answer cach 
Check to ved » giving your own opinion, and 
questionnaire Eos you complete the entire 
checked 2 ES ; Vhen you have finished and 
naire in Na your answers, deposit this question- 
“Psychol e T oni room in the box marked 
SE Dorset Department Questionnaires.” All 

hack are anonymous, so do not sign your name. 

ank you for your cooperation. 


O asse h : d 
The 5 Ss the manipulations and possible misgivings. 


orm 


reedom did These included, "How 
you have to choose whether or not you 

Siekes orm this additional function?" "How 
Unction >» ee, You in performing this additional 
Bivings ab To what extent did you have mis- 
about performing this function?” Answers 


1 
An E 
revealed oe of variance of this "cover item" 
Was found he same significant triple interaction that 
for the main dependent variable. 
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TABLE 2 
MEAN REPORTED ENJOYMENT 


High consequences | Low consequences 


Investment of time 


High Low High Low 

choice choice choice choice 

15 minutes 18.67 | 10.67 | 15.67 | 15.33 
1 minute 8.44 | 13.78 | 16.33 | 12.67 


_ Note. Control = 14 high time-anticipation — 15.25, low. 
time-anticipation = 1 Ratings were made on a 3l-point 
scale in which higher numbers indicate more enjoyment. 


were made on 7-point scales in which low numbers 
indicated the least amount of the attribute in ques- 
tion (freedom, success, misgivings). 


RESULTS 
Perceived Choice and Consequences 


A least squares analysis of variance of the 
perceived-choice ratings yielded only a 
main effect for choice, F(1, 59) = 102.94, 
p <.01. Ratings on the 7-point scale 
indicated greater choice or freedom in the 
high-choice condition (X = 6.75) than in 
the low-choice condition (X — 2.61). 

A least squares analysis of variance of 
success in carrying out the “additional 
function” resulted in a significant main 
effect for consequences, F(1, 59) = 41.02, 
p «.01. There was more reported success 
in the high-consequences condition 
(X = 5.99) than in the low-consequences 
condition (X = 3.98). This analysis re- 


sulted in one more significant effect, a 
main effect for choice, F (1, 59) = 9.63, 
High-choice subjects reported 


p <.01. 
more success (X — 5.48) than did low 


choice subjects (X = 4.50). The stooge's 
rating of the adequacy of the counterat- 
titudinal performance, however, did not 
agree with the subjects’ perceptions—the 
choice F being less than 1. 


Enjoyment of the Task 


Mean reported enjoyment of the task is 
contained in Table 2. A least squares 
analysis of variance resulted in only one 
significant effect, the triple interaction, 
F(1, 64) = 6.04, p < .017 (this triple inter- 
action was significant in a multivariate 
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TABLE 3 
MEAN REPORTED MISGIVINGS 


High consequences | Low consequences 


Investment of time 


High | Low | High | Low 
choice | choice | choice choice 

15 minutes 2.44 2.25 3.44 2.00 
1 minute | 4.29 2.38 242 2.56 


Note, Ratings were made on a 7-point scale in which high 
scores indicate greater misgivings. 


analysis of variance covering all of the 
assessments). Breakdown of this Choice 
X Consequences X Investment of Time 
interaction resulted in a significant Choice 
X Investment of Time interaction within 
high consequences, F(1, 32) = 8.76, p < .01, 
that was nonsignificantly reversed within 
low consequences, PD» $2) <1. The 
double interaction within high consequences 
resulted from a significant dissonance 
effect with high choice, t(64) = 3.02, 
P < .01, and a nonsignificant reinforcement 
effect with low choice it <1), 

The control mean, 14.12, diff 
cantly both from the high-dissonance mean, 
1(87) = 2.81, p <.05, and the low-dis- 
Sonance mean, /(87) = 2.4, p < .05, 

After completion of the experiment it was 
Suggested to us that perhaps a more ac- 
curate control procedure would involve 
telling subjects that they were, in fact, 
control subjects and that there was po- 
tentially more to the experiment than just 
working on thetask. In any event, we ran 
two additional control groups, one of which 
simply worked on the task (as above), and 
another of which was given the additional 
information. Each of these control groups 
contained 12 subjects. Mean enjoyment 
was identical (13.91) for the two groups. 

The means for the high-time-anticipation 
condition (15.25) and low-time-anticipation 
condition (15.75) do not differ significantly 
from each other or from control, 


ers signifi- 


Reported Misgivings 


Mean reported mis 
in Table 3. High numbers indicate gre 
misgivings. A least squares 
variance resulted only in 


givings are reported 
ater 
analysis of 
à significant 
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triple interaction, F(1, 59) = 4.34, p < 05. 
This triple interaction to a large extent 
results from the pattern within the dis- 
sonance cells (high choice, high conse- 
quences). Low dissonance resulted in the 
most reported misgivings and high dis- 
sonance in low reported misgivings. 

The Calder et al. (1973) study found that 
observers attributed high misgivings in the 
high-dissonance cell, while actors reported 
low misgivings in the high-dissonance and 
high-reinforcement cells. The present data 
are comparable, in that high dissonance 
(relative to low dissonance) resulted in 
low reported misgivings. These data, 
however, do not indicate a similar trend 
within the reinforcement cells— possibly 
because there was no reinforcement effect 
(unlike the Calder et al. study). 


Discussion 


The results of the experiment substanti- 
ate our major expectation of triple inter- 
action for rated enjoyableness of the task. 
With high consequences there was 
ficant double interaction. between 
and investment of ti 
significantly reversed with low conse- 
quences. The significant double interaction 
resulted from a significant dissonance effect 
with high choice anda nonsignificant rein- 
forcement effect with low choice. Both 
the high- and low-dissonance cells differed 
Significantly from control. 

The high-time-anticipation and low-time- 
anticipation conditions did not differ signifi- 
cantly from each other. Apparently manip- 
ulated anticipated time, together with 
manipulated anticipated consequences ar 

igh choice, are not sufficient to produce the 
dissonance effect, as suggested by the 
Wicklund et al. (1967) experiment. 
© are, of course, gratified that invest 
ment of time and reward both interact Wit? 


choice and consequences in the context ° 
the Festinger an 


digm. There are 
differences l 
the Calder e 


have a bearing upon the previously allu 
to affective value of high reward. 1” 
present experiment the control mea? 


a signi- 
choice 
me that was non- 


che 
is 
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intermediate to the high- and low-dis- 
Eos means, and differs significantly 
ee In the Calder et al. study, 
im 19 d ina tral mean differs only from 
sonance mez ees eee The low-dis- 
ited cn " is actually slightly above the 
| mean (see Table 1) This fact is 
potentially embarrassing to our previous 
E that the threat of responsibility, 
ie ples sg een there is low choice and 
high pie : flossémforeemeut cell) and 
Mn at high reward (low-dis- 
denial of i (ds followed by a forceful 
Yaliüaffontof piece and subsequent de- 
Erme m ea . Fortunately for that 
recent dn he low-dissonance cell of the 
why did - a is indeed below control. But 
cXperime " not occur in the Calder et al. 
the eur à A possible answer relates to 
the Ming orar of the high reward. An 
reward ma M righ consequences, the high 
ue to its e£ taken o miae salience 
menter cau S ication that the experi- 
happened. = Ke responsibility for what 
act the afore his in turn might counter- 
Sade oe ic eens tendency to down- 
these. ask as a result of responsibility 

7 and denial. 
Beste” that this argument is highly 
allow us E T he argument, however, does 
mean in e oon not only for the low 
the lack of ow-dissonance cell but also for 
reinforceme a reinforcement effect. The 
3) a effect in the. Calder et al. 
e E a is significant; in the present 
Téinfonceriene Furthermore, this lack of a 
act that "ques gag dert to the 
ment cell he mean in the high-reinforce- 
“alder et a not above control. In the 
trol Would . study an increase above con- 
affective MAE been facilitated by the 
Present zs a of the high reward. In the 

it Sum udy there was no reward. 

Count for ie we have attempted to ac: 
resent ex he high consequences data of the 
Geninen. aan and of the Calder et al. 
"irst, we Ding a three-part ap 
issonance ali argument for the high 
Ment, the E According to this argu- 
‘Nioyable Subject decides that the task is 
"Sbonsib: in order to be able to accept 
ibility for the counterattitudinal act, 


at 
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as the high choice and low reward indicate 
that she should. 

Second, we have an argument directly 
relating to the low-dissonance and low- 
reinforcement cells, and indirectly relating 
to the high-reinforcement cell. In the low- 
dissonance and low-reinforcement cells 
there is conflicting evidence regarding 
responsibility that leads first to a consider- 
ation of acceptance of responsibility and 
then a forceful denial of responsibility. 
This sequence increases the salience of the 
negative consequences and suggests, via 
attributional inference, that the avoided 
consequences are indeed negative. Thus 
the rating of the task is lowered in the 
low-reinforcement and low-dissonance cells. 
This argument is indirectly related to the 
high-reinforcement cell, where the lack of 
evidence for responsibility does not result 
in such a forceful denial of responsibility 
and thus a lowered evaluation of the task. 

Third and finally, we have an argument 
regarding the affective value of the high 
reward that was present in the Calder 
et al. (1973) experiment and is absent in 
the current experiment. The affective 
value of the high reward prevented the 
low dissonance cell of the Calder et al. 
study from going below control (as it did 
in the present study) and produced a 
higher rating in the high-reinforcement cell 
and thus a significant re 


inforcement effect 
(unlike the present study). 

Toward the beginning of this article 
the rather ambitious statement 
could account for all 
So far we have only 
ance situations, in 
, choice, and in- 
ompliance situ- 
a similar 


we made 
that balance theory 
dissonance effects. 

discussed forced-compli 
particular the role of reward 
vestment of time in forced-c 
ations. It should be apparent that 
analysis could be made concerning the role 
of punishment and effort in forced-compli- 
ance situations. The salient issue here 
revolves around the extent to which punish- 
ment and effort inherently involve negative 
consequences. Our guess is that punish- 
ment does involve negative consequences 
and that effort does not. But what about 
ations? A balance inter- 


free-choice situ 
choice situations 1s very 


pretation of free- 
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straightforward. Two cognitive bands are 
involved: the self (4-) selects or chooses 
(+) alternative A (?), and the self (+) 
rejects or does not choose (—) alternative 
iB (?). According to the multiplicative 
rule, alternative A should move in the 
positive direction and alternative B should 
move in the negative direction; that is, 
"spreading of the alternatives" should 
occur. It should also be noted th 
implicit choice could produce some s 
ing before the explicit choice, 
spreading will be most marked after the 
actual choice, when the linking cognition 
is strongest. Thus balance theory and 
dissonance theory agree in regarding the 
Spreading of the alternatives as a post- 
decisional, rather than a predecisional, 
effect. It is also true that this theoretical 
account, like dissonance theory, is poten- 
tially embarrassed by regret (Walster, 
1964). 

In general, we are impressed by balance 
theory's ability to account not only for 
dissonance effects but also for such diverse 
additional effects as attitudinal verbal 
reinforcement (Insko & Cialdini, 1971), 
affective conditioning (Insko & 
1972), interpersonal attractio. 
Cope, 1968), and overevaluation of in-group 
products (Ferguson & Kelley, 1964). De. 
Spite its lack of general Popularity or ac- 
ceptance, balance theory remains one of the 
most generally applicable theories in all of 
social Psychology. The main problem with 
the theory is that it has not been developed 
to specify which Cognitive band 
vant to which situations, 
in balance theory 
example, that a cogniti 
self-responsibility for 
is relevant to the F 
(1959) situation, 
and indeed of all 
then" format. If cei 
exist, then certai 
follow. The prob 

is that we are frequently very i 
regarding the “if” conditions, 

The above perspective on p, 

is exactly analogous to the 
cepted perspective on vario 


at an 
pread- 
but the 


Schopler, 
n (Aronson & 


negative Consequences 
estinger and Carlsmith 
Tests of balance theory, 
theories, follow an “if, 


alance theory 
generally ac. 
US hedonistic 
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theories (exchange theory, for example). 
Exchange theory is a general position w 
garding the importance of rewards and costs 
in the determination of behavior. The 
theory, however, does not say what indi- 
viduals regard as rewards and costs in par- 
ticular situations. Thus what appears to be 
an empirical test of exchange theory is in 
reality an empirical test of the assumption 
that certain rewards and costs are operating 
in a certain situation. Similarly, what ap- 
pears to be a test of balance theory is i. 
reality a test of the assumption that certain — | 
cognitive bands are operating in a given | 
situation. Thus we theorized that a band | 
Suggesting possible self-responsibility for | 
counterattitudinal advocacy and associated 
negative consequences is important in the 
Festinger and Carlsmith (1959) paradigm. 
his in turn led us to expect that, contrary 
to the Wicklund et al. results, investment 
of time should interact with choice and 
Consequences in a fashion roughly analogous 
to reward. This, in fact, happened. We 
would be the last to claim that such positive — 
results confirm a point of view. The last 
Word on what has historically proven to be 


both a puzzling and intriguing phenomenon 
has undoubtedly not been said. 
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Perturbation Model for Letter Identification 


George Wolford 


Dartmouth 


College 


i ^ itative "Ci t 
A model is presented which provides a reasonably good eas es Nene ae, 
of the data from a series of tachistoscopic report experiments in which retinz 


locus, spacing, and array position were systematically varied. 


Potential 


extensions of the model to other paradigms are also suggested. Ej dene 
assumptions of the model are that features are extracted from Vista oe 
in unlimited parallel channels as a function of retinal locus; that a | eee 
working serially, attempts to identify letters from the extracted fea 5; 


and that during 
features. 


The primary purpose of this article is to 
describe, justify, and elaborate a prelimin- 
ary quantitative model for a subset of the 
letter-identification process. The model 
was formulated as a computer simulation 
and was specifically designed to account for 
the data from a series of tachistoscopic 
full-report experiments, in which retinal 
location, array position, and spacing were 
systematically varied. The article begins 
with a consideration of some of the recent 
theoretical advances in the area 
description of the empirical base 
model. The model is then described 


with an account of its corresponde 
the empirical data, 
1$ suggested how the 


and a 
of the 
, along 


nce to 
In the final sections it 


model can be applied 
to other paradigms, and some comments on 
its relationship to other models are offered. 


Mopzrs or VISUAL PROCESSING 


A number of very 
formation-processing n 
in the last few years 
sented an 


This article was written while the 
visiting the Center for Human Information Proces- 
sing at the University of California, San Diego. 
The center (under partial Support from National 
Institute of Mental He; 15828) was 
generous in its assistance. Thanks are due William 
K. Estes, Samuel Hollingsworth, and David E 
Rumelhart for their advice and assistance, j 

Requests for reprints should be sent to George 
Wolford, Department of Psycholo 


gy, Dartmouth 
College, Hanover, New Hampshire 03755. 


- Estes (1972 
interactive-chan 


author was 


uth grant MH- 


processing, perturbations occur in the spatial ordering of the 


account for the data from a comprehen- 
sive detection experiment. The main 
assumptions of the model are (a) informa 
tion is extracted from visual displays i 
parallel at the feature level, (b) the density 
of detectors for each feature type decreases 
with increasing distance from the center of 
the fovea, and (c) decisions are made 0n 
the basis of the features extracted at the 
various retinal locations. 
quantitative aspects of the 
been fully specified, but 
tions can be deduced fr 
The first is that only 
interfere with one 
interference arises 


Many of the 
model have not 
à number of predic- 
om the assumptions 
confusable items wil 
another. 'Their mutua 


"e e 
over competition for m 
Same set of feature detectors. Items that 


are not confusable would presumably not 
share any common features and would not: 
therefore, compete for the same set Rr 
feature detectors, In a detection expe! 4 
ment, then, the number of noise element? 
in the display would not be a relevan 
variable unless the noise elements wa 
confusable with the targets. This d 
diction was borne out by Estes (1972). 1 

second prediction is that the amount 


n 1 A 
interference among confusable items eri 
crease as the items move into the P% e 
phery 


he 


d: 
i for 

rea (Estes & Wol wet 

a 


atthe 
Rosner, 1960; Mat? ha? 
The Iteractive-channels mode 


—4 
x 
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E. yet been applied to other paradigms, 
"4 as full report or partial report. 
kom ko (01273) and Shiffrin and 
independent ; vod ee versions of an 
Ensing dise para lel-channels model. The 
Eu cussion focuses primarily on the 
Glace V Dena by Shiffrin and Geisler, 
tive dum s ;* eloped in greater quantita- 
iplo: r M addition, their computer 
modal d s certain similarities to the 
inl = will be developed later. The 
(a) gees of their model follow: 
dependent p. robabi e pt kas some m 
(b) If inl Smid of being recognized. 
ola ced ree is not recognized, each 
Probability bi ied has some independent 
pt rus s of being detected. (c) A cor- 
lekte "in "is B made if the entire target 
Petal | ee or if one of the sets of 
the tan auc represents à subset of 
sions ia presented. Once again confu- 
in inia dun dh ZA primary source of error 
etesaarit n o s, since confusable letters 
Slonconhiesble noi some similar subsets. 
since the S ple noise letters have no effect, 
target. Wich fe lead to a subset of either 
beisen "à iie ew parameters, Shiffrin and 
account ee ed an excellent quantitative 
model has ne data in Estes (1972). Their 
Cision e. not yet been applied quantita- 
Thy to other paradigms. 
nnde modei under consideration was 
referred to by Rumelhart (1970) and is 
model. Th as a limited-capacity parallel 
as follows: puce of the model are 
display is | 2 Information from a visual 
decays Ris =) ge in some iconic form but 
is tasked 4 with time. (b) Information 
eaitiavas rom the store, in the form of 
of attenti a rate dependent on the amount 
tion, (c) p^ allocated to a particular loca- 
attention a ud dm have a finite amount of 
) The ls can control it$ allocation. 
Per item M of information required 
Varies with oo maximum performance 
More infor; the paradigm. For instance, 
respond co mation per item is required to 
detection mey in a report task than in a 
inent sou ask. Confusions are a prom- 
Items pring: of error, but nonconfusable 
compete 1 also lead to errors, since they 
e for the limited amount of avail- 
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able attention. Rumelhart's model pro- 
vides an accurate quantitative account of 
performance (overall per cent correct) in 
a number of different paradigms. 

While the differences among the models 
are testable experimentally (Estes, 1974; 
Gardner, 1973), they nevertheless share a 
number of primary assumptions. All three 
models assume that information is ex- 
tracted in parallel and that the extraction 
process works at the level of features. In 
addition, they all assume that confusions 
are a major source of error in detection 
tasks (this is in part a corollary of the 
assumption of feature extraction). The 
success of these models in accounting for 
the data from various detection paradigms 
he feasibility of the shared 
one of the models has been 
ke predictions about the 
rformance in report 
the shape of the 


argues well for t 
assumptions. N 
developed to ma 
specific nature of pe 
paradigms, for example, 
serial position curves, or the effects of such 
variables as retinal location, array posi- 
tion, and spacing. The model developed 
in this article is primarily concerned with 
accounting for the effects of these variables 
and borrows heavily from the preceding 
models. 

A number of theorists in the area have 
avoided report paradigms, partly because 
of their belief that those paradigms are 
higher order processes, 
such as acoustic short-term memory (Gard- 
ner, 1973; Shiffrin & Geisler, 1973). Their 
view is that short-term memory capacity 
places an upper bound on the number of 
letters that can be reported in the full- 
report procedure, and further that the 
upper bound is less than the number of 
letters actually "seen." Since their interest 
has primarily been in studying what is 
actually "seen," report tasks are inap- 
propriate. There is, 


however, little direct 
evidence that short-term memory is in- 
volved in report tasks. Wolford and 
Hollingsworth (19742) 


argued against 
short-term memory restrictions, based on 
two considerations. 


The first reason was 
that the performance 


asymptote in the 
standard full-report procedure is approxi- 
mately 4.0-4.5 letters 


contaminated by 


(Estes & Taylor, 
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Number of Letters Reported 


“15 “10 


Ficure 1. The mean number of letters 


placement of a display. 
equal 1.05°. 


substantially below 
short-term 
n about 7.0 
1970). The 
ct responses 


& Speelman, 
'as that incorre 


t ave argued that shor 
memory is normally acoustically 


have shown that acoustic conf, 
prominent, Particularly at short retention 
intervals, regardless of the mode of Presen- 
tation (Conrad, 1964 i Sperling & Speelman 
1970). , 
It is Possible, of course, that the per- 
formance asymptote in full-report pro- 
cedure is the result Pacity limitation 
in a short-term me; ith a Nonacoustic 
representation. The existence of 
short-term memory has been 
Estes (1973), among others, 
preceding arguments Were 
acoustic short-term memory, 
argument against short-term m 
pacity as a major influence in 


t-term 


usions are 


Specific to 
One final 
emory ca. 
full-report 


(Abscissa numbers refer to elite 
Abbreviations: Obs = obser 


GroncE WoLronp 


9----- Obs 
——s Pred 


“5 Oo 5 lo 
Retinal Locus 
reported as a function of the retinal 
type spaces, § 
ved; Pred = predicted.) 


Procedure is not Specific to 
type of 
memory, 


Wolford and Hollingsworth (1974c) pre- 
sented linear arrays of nine letters each at 
18 different retinal locations. Figure 
illustrates the mean number of letters m 
Ported per string asa function of the string $ 
retinal locus. The numbers on the abscissa 
refer to the retinal locus of the leftmost 
letter of a String. Each number represent? 
one typewriter Space (.20? of visual angle 
and negative numbers refer to positions ". 
the left Visual field, A string which begi” 
at — Symmetrica]ly spans the center ° 


veh "I" 
the fovea. As is illustrated, the number 


r 
letters reported peaks at about four lette 
for displays which 


? Span the center of t i. 

Ovea and €creases to about two lette j. 
Somewhat in the periphez 
memory capacity were d 

nce in full.report prose Ho?» 
Probably expect a flat func a 
ie Capacity limit would prov : 23 
ceiling effect. In other words, if the Pore 

i ally "seeing" many ™ 


ex” 
letters than were reported, one would 


any particular 
representation in short-term 
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pu the number of letters reported to 
Fuse tern memory capacity over 
d E of the retina considered 
stanly iudi jan the smooth decline which 
o di peii as the display is moved 
aa c nter of the fovea. 
ostii, i ig arguments presented 
E m eis y conclusive, I have chosen 
dinde t model the data from report 
(Goede oe reference to memory 
fae | S, . he same set of processes is 
En ,in dealing with all of the paradigms 
nsidered. 


THE EMPIRICAL BASE 


J, 
bue i the models presented above 
the mae quantitative predictions about 
found b i T serial position curves 
many dendi report paradigms and in 
reple ion ena, io A serial posi- 
asa stion T especies of performance 
a display re ZI position of an item in 
vary with number of factors typically 
retinal loi: TUS display pasen: the 
the one E of the item, its position in 
Number an s processing orders, and the 
"he vin Poeran of surrounding letters. 
identificatio T retinal location in letter 
iere is i jas been somewhat unclear. 
decreases y question that threshold acuity 
center of üt rapidly as one leaves the 
lalf of ees ovea, declining to less than 
Of the Cates TÉ acuity within a degree 
Teason ian (Alpern, 1962). There is no 
ites resented: to assume that the 
Seog A in typical tachistoscopic 
thickness S us near threshold size or 
Beste that al investigators have sug- 
ularly im retinal location is not a partic- 
; portant variable in letter identi- 


cati ain 
man ^ (Bryden, 1966; Crovitz & Schiff- 
» 1965). Other experimenters have 


Sug; 
est : 
S Sted that retinal location may not be 


cry im . à à 
become Portant by itself, but that it can 


middle string positions are 

by retinal locus. 

Vary vit, aoe above, several factors 

Vestigat 1 display position, and most in- 

Control Ors have exercised little or no 
Over these variables in examining 
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retinal location. For instance, most in- 
vestigators do not instruct their subjects 
as to the order in which to report the letters. 
If subjects report the letters in a mixture 
of orders in keeping with their biases, and 
if report order affects performance, it 
would be quite difficult to partial out the 
retinal effect from the various order effects. 
Several experiments have been carried out 
to examine the effects of retinal location 
with the other variables partialed out. One 
such experiment (Hollingsworth & Wol- 
ford, Note 1, Experiment 1) examined the 
effect of retinal location in a simple situa- 
tion, with report order eliminated as a 
variable by having the subject report only 
one letter. Single consonants were pre- 
sented at one of five retinal locations flanked 
by zero, one, or two noise letters (the 
letter U). Subjects were instructed to 
report only the consonant on each trial. 
The results of primary concern for this 
article are presented in Figure 2. There 
are several important points to notice in 
the data. The first is that even in the ab- 
sence of interference, letter identification 
performance did decrease with increasing 
distance from the center of the fovea. The 
decrease was not nearly as steep as the 
threshold function in Alpern (1962), but, 
as mentioned earlier, there is no reason to 
believe that the stimuli approached thresh- 
old size or thickness. The second im- 
portant point is that the mere presence of 
unreported adjacent letters decreased per- 
formance azid accentuated the retinal locus 
effect. 

In another experiment retinal location 
and string position were varied orthogonally 
(Wolford & Hollingsworth, 1974c, Experi- 
ment 1). Nine-letter linear arrays were 
presented at 18 different retinal locations. 
Subjects were instructed to report as many 
letters as possible and to process and report 
the letters in a left-to-right order. Their 
adherence to the instructions was moni- 
tored and feedback was presented every 15 
With this design it is possible to 
look at the effect of retinal locus with 
string position held constant. For in- 
stance, the leftmost letters of each string 
d the same position in the report 
and had the same 


trials. 


occupie 
and processing orders 
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FIGURE 2. Letter recognition 
levels of lateral masking (fro; 


number of Surrounding letters, Any varia- 
tion in performance among the leftmost 
letters, therefore, must be attributable to 
retinal location. In a similar manner the 
retinal function can be examined for 
of the nine string positions. 
functions are Presented in Figure 3. 
formance alon 
a function of retinal locus, 
among curves are ; 
tion. String Position in this 
a composite of the effe 
order, report order, and Spacing, 
apparent in Figure 3 i 


ng positions but that 
marked at the 


at the center of the fovea and decreased 
rapidly over t] 


he first 1,5° from the center, 
It is also clear that when retinal loc 


was held constant stri 
powerful variable. 
rapidly over the fi 


al positions and 
rose somewhat at the end i 


The two end positions (1 and 9) were 
Somewhat less affected by retinal locus 
than the other positions, One of the 


distinctive features of the ends is the pres- 


as a function of retin 
m Hollingsworth & Wolford, Note 


al position for three 
D. 


ence of a space on 
investigators have 
letters receive 


One side. Several 
demonstrated that 

interference from nearby 
adjacent letters (Estes & Wolford, 1971; 
Shaw, 1969. Townsend, Taylor, & Brown, 
1971). Space on one side of a letter 
therefore, Potentially frees that letter from 
Some amount of interference, Two proper- 
ties of that interference were of particula 
Importance in the development of the mode 
to be Presented in the next section. The 
first is that the amount of interference 
increases With increased distance into the 
Periphery, and the second is that the 
Interference appears to be asymmetric 
With. letters receiving more interference 
from letters 9n their peripheral side thay 


from letters on their foveal side (Wolford 1 
Hollingsworth, 1974)), 


In the Wolford and Hollingswort? 
(1974) experiment, Subjects were P! 
sented with nine-ltter linear arrays ag 
t five different retinal n 
cts were instructed to proc ht 

the letters in a left-to-rif 
order, and Spaces were introduced ter 
Some of the Strings either before or @ i 
the fifth letter in the instructed proce e 
order (pre- and Postspaces). The P ei 
formance On the fifth letters in that eXP 


Subje 
and report 
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PROBABILITY CORRECT 


189 


O 5 10 I5 


RETINAL LOCUS 


Ficung 3, 
denoted by s. 


ment . . 
isplays shown in Figure 4. Some of the 
the fifth Toe two As before or after 
Jects were letters instead of spaces. Sub- 
le pur instructed not to report those As. 
of the eios of the A displays was control 
fifth (Mi gps order advantage provided the 
Spaces. In by prespaces but not by post- 
Unmasking other words, in addition to any 
Space zm that a space might have, 

: l'OCessing ore a letter in the instructed 
In the ooo also moves that letter up 
Fa Shine ee order. The sensory effect 
Sach Space can be examined by comparing 
condi oe with its correspond- 
ler to th ion. The numbers on Figure 4 
Priate co € ¢ values between the appro- 
Was a xi mparisons. When the fifth letter 
Sith uide center of the fovea, a space on 
e was equally ineffective. When 


Ing 
Tefer 


ach of several string positions, 


Performance asa function of retinal location at e: 

nformation Pr (From “Retinal Location and String P Important Variables in Visual 

16, 437-442 rocessing" by G. Wolford and S. Hollingsworth, 
- Copyright 1975 by The Psychonomic Society, 


osition as 
Perception & Psychophysics, 1974, 


Inc. Reprinted by permission.) 


the fifth letter was in the periphery, spaces 
had a marked effect. As mentioned earlier, 
however, the space on the peripheral side 
of a letter was much more effective than a 
space on the fovealside. In the left visual 
field, then, a prespace was more effective, 
while in the right visual field, a postspace 
was more effective. The increased per- 
formance at String Positions 1 and 9 in 
Figure 3 and the flatness of the retinal 
function at those positions are probably 
due, in part, to such spacing effects. 


Even with the spacing’ effects held rela- 


tively constant, there wasistill a strong 
string-position effect in Figure 3. This 
effect is indicatedgby}the consistent drop 
in performance as one moves among the 
middle string-position curves. 'These posi- 
tions each had at least one letter on both 
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FIGURE 4. Probabilities of correct r $ inal mere 
. i reters 
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Ir and C refers i i à . (From “Lateral Masking in Visual Informa- 
tion Processing” by G. Wolford &S. ollingsworth, Perception & Psychophysics, 1974, 16, 315-320, 
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i ial 
P across these that determine the Shape of the Lee am 
to variation in Position curves. The shape of a particular 
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tually independent of report order. In the When no Instructions are given, English 
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: ined the center of the foren 
t those experiments Subjects might be expected to employ M 
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its Were ins the right visual field. Sub- 
il. dite to process and report 
us -right order. These curves 


trat 
a “te so T Eit 
b es deriba combinations of the vari- 
etinal oe (e.g., the interaction 
acing for - ation, processing order, and 
@ particular string). 


Tur p 
= PERTURBATION MODEL 


to © per NEN 
a "i wie ation model was formulated 
in Presented. lantitative account of the 
fre arious ass in the preceding section. 
Th ' from Tim. HMM the model borrows 
ection dl nele described earlier. 
Dtiong PS with a listing of the 
Der of the model, followed by a 
be, "E the explanatory comments con- 
9 Calle Pamptions. Finally, a num- 
the, involved in the actual simu- 

model will be discussed. 


Dun? 


acement 


— observed; Pred — predicted.) 


Extraction Assumptions 

1. Information is extracted from visual 
displays at the feature level via indepen- 
dent parallel channels and is then stored 
in a sensory information store. 

2. The probability of a feature being 
extracted is a function of retinal location, 
size, contrast, etc. 

3. The spaces between letters are ex- 
tracted just like other features, namely, as 
a function of their retinal location, width, 


etc. 


Processing and. Decision Assumptions 


4. The information in the sensory store 
is processed in serial in order to identify 
the letters. Subjects have considerable 


control over their processing order. — 
5. The first task of the processor is to 


parse the various features into groups. A 
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feature group is defined as all of the non- 
space features between two spaces. se 
6. The decision criteria used in vpn 
ing the letters depend upon the task. : or 
most of the report tasks considered here, 
it is assumed that if all of the features of a 
letter and no additional featur 
ent in a feature group, then that letter is 
output. Additionally, it is assumed that 
if the features in a group represent a proper 
subset of one and only one letter, then that 
letter is output. Otherwise, an omission 
error is made. 
7. The time it takes to process 
is a function of the amount of information 
required. Different experimental para- 
digms require different amounts of informa- 
tion per letter. For simplicity, it is as- 
sumed that each letter in a report task 
requires one Af of Processing time, 


es are pres- 


a letter 


Interference Assumptions 


8. All of the extracted features, 
ing spaces, are assumed to have some 
ordering. 


At of time, that is, 
relative spatial ordering of the fe 
over time. 

9. The Probability of 
a particular feature is 
retinal location of the 
amount of interletter spaci 
The wider the Spacing in 
smaller the Probability of 
between feature groups, 


a perturbation of 
a function of the 
feature and the 
ng in the display, 
the display the 
a perturbation 


to extract a Space. 


Third 
jects are assumed toh 


avec 
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. e 
most theorists have not "rapid i 
problem of parsing features into non k- 
by default have assumed that featur ai 
automatically placed in appropriate gr E. 
When spaces are not detected in pe E. 
turbation model, the features from ie m 
more letters are placed in the same € 
group and the identification process a that 
since it is unlikely (but not impossible A 
the combined features from two or eo A 
letters would equal some other letter 
subset of some other letter. M- 
The decision criteria described in cil 
sumption 5 for report tasks are fairly i 
Servative. For instance, if a partic oft 
feature group is a subset of two or e 
letters, an omission error will be ma 


: ould 
One could assume that subjects wW 
Suess from the a 
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an make an omission err c 
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Pom d d T features per letter ranges 
B. The Sb RS iM 10 for the letter 
ular experiment cle panni ati ag al 
Particular font clearly depends upon the 
The feature set NO in that experiment. 
Was chosen fo iie Lindsay and Norman 
emet t5 pr ses simulation because it 
ization of ete ide a reasonable character- 
esee. ss used in the experiments. 
ack of m of the feature set is the 
instance 2m i ational information. For 
the palatine ; ‘elt ia is included about 
iy ous the wo of the features 
ever: ; 
"Merle i-r mee were required for the 
which detenn the model. These functions, 
Perturbation une feature extraction and 
and are diffic 2s not directly observable 
considerations t F approximate from other 
all exponential he functions chosen were 
efend the Sh and no attempt is made to 
of functions Particular choice. Other sets 
Cuce distin, ORABUY exist that would pro- 
Cern in this results, but the primary con- 
Ol the article lies in the specification 
choice Processes involved and not in the 
Simulation A i functions. The actual 
“at there dm carried out to demonstrate 
and param ists at least one set of functions 
@Pproximation = that provide a reasonable 
Context of ai of the empirical data in the 
P Soll above assumptions. 
the iip, e ee are required for 
Probability oe of the model: (a) The 
Tom a cha eh feature being extracted 
Ovea ję TaCter a? from the center of the 


ae- GV, (1) 


elegant The cubic, while somewhat 
wide plate, was required to account for the 
Pr ves 4 the single letter data around 
Ted a festar ) The transition probability 
foy TOM the Owing from a feature group 

Vea is center to a group nearer the 


©) 7 aa eee, Q) 
Ne t Te 
fna. 9 tn 
ovi ee probability for a feature 
Stoup fa m a group x? from the center to 
Tther from the fovea is 


i es the Hr en. (3) 
Med 4 le perturbation process is as- 
e à random walk, the sum O 
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Equations 2 and 3 must be less than or 
equal to 1.0 for a given value of x. The 
following parameter values were used in 
all of the subsequent simulations: a — .995, 
b = 160, ki = .988, hı = 64, ks = .988, 
and A. = 16. The parameters a and b 
would be expected to vary with the physical 
properties of the display, and in particular 
with the size and thickness of the char- 
acters used. The As and ks would depend 
primarily upon the amount of spacing 
between the letters in a display. In the 
various simultations a, ki, and kə were 
quite sensitive. Small changes in their 
values had large effects on the simulated 
output. The parameter values used do 
not necessarily represent the optimal values. 
The simulation required about 45 minutes 
of computer time per 1,000 trials, so no 
attempt was made to exhaustively search 


the entire parameter space. 

A sample run through a trial of the 
simulation might prove useful in conveying 
the nature of the model. This sample trial 
ative of a simulation of the 
data depicted in Figure 3. The trial begins 
by choosing 1 of 18 retinal locations in 
which to present the display. Nine letters 
are then chosen at random from the con- 
sonants, without replacement, and the 
order of the letters is randomized. With 
some probability (specified by Equation 1 
above), each of the features from each of 
the presented letters (including the spaces 
between the letters) is stored in a sensory 
register. For instance, if B were one of the 
presented letters, each of its 10 features 
would have an independent probability 
of being extracted, depending upon the 
location of B on the retina. The parser 
then places all of the nonspace features into 
feature groups, with a group being all of 
the nonspace features between two spaces. 


In accordance with the experimental in- 
structions, the processor attempts to 
identify the letter represented by the left- 
most feature group, using the decision rule 
stated in Assumption 6. Following the 
processing of that feature group, each of 
the features in all of the groups has some 
probability (as specified by Equations 2 

an adjacent feature 


and 3) of moving to 
The processor then moves to the 


is represent 


group. 
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FIGURE 6. Simulated performance 


next group in the left-to-right order 
attempts to identify it, The simulation’s 
response on that trial would consist of 
those consonants for which a perfect match 

i feature groups or 
‘as found in one 
nsist pri- 
i although Some in. 


and 


of the groups. p 
marily of omissions 
trusions would be made, 
match does not guarantee a 
Sponse, since perturbations or Unextracted 
features might have transformed the fea. 
ture group, resulting from à presented letter 
into the Perfect match of another letter, 
ACCORDANCE WITH THE ] 


EMPIRICAL, Base 


The model was first applied to the data 
from Wolford and Hollingsworth (1974c, 
Experiment 1; see Figures 3 and 5). That 
experiment had 162 data Points (18 retina] 
locations crossed with 9 string positions), 
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O 5 lO I5 


Retinal Locus 


as a function of re 
tions, denoted by s, 


tinal locus at each of severa] string posi- 


The simulated values are 


00 
based on 1,0 
trials per point. 


t 
One indication of he fy 
of the model is illustrated in Figure 5. | che 
pen points and solid lines represent eri 
simulated values. The fit to that exp ói 
ment is further illustrated in mieria 
which corresponds to the empirical 9* 
Presented in Figure 3, While it is dime 
to illustrate the Correspondence iem $ 
two sets of 162 points simply, it um 
that the model captures most of the pet!” 
tional Nature of the data from that exl pot 
Ment quite well, In addition, there do ic? 
seem to be any Strong systematic differé ta 
pirical and simulated and 
analysis from Wolford’ j 
Hollingsworth’s (1974c) experiment t pe” 
relevant to the model concerns the " yeh 
Number of letters Output per p sof 
String as à function of the retinal l0 gi 


6 
th? 
of that String, The significance of 
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: TABLE 1 
SIMULATED PR ION 
ROPORTION CORRECT AS A FUNCTION OF RETINAL LOCUS AND SPACING 


Retinal locus 


=9 k = z 
oo six E D HERE M Pm E 
m m 17 21 18 18 19 20 40 
o 2 35 19 16 22 
87 24 19 17 23 43 P 
97 61 54 52 48 37 25 15 33 
97 66 62 35 26 21 32 
96 61 32 51 63 20 33 
98 71 46 27 17 12 08 1 17 
97 84 17 09 08 10 13 
98 76 45 23 47 19 18 


data w n 
empirical acd cussed earlier, and both the 
Figure 1. A simulated data are shown in 
Correspondence | again there is reasonable 
simulated v. ce between the observed and 
It ated values. 
tive Esc useful to provide an intuit- 
model provi - of why the perturbation 
empirical q T! reasonable fit to the 
Osition 1 data. The letters at String 
erence dene not subject to much inter- 
the proce ce they are at the beginning of 
le tmost pies order. Performance on the 
marily dod therefore, reflects pri- 
Sented as ne extraction function pre- 
Creases with QUAUOR 1g Performance de- 
anther our String position because letters 
Undergone pie the processing order have 
the Process more perturbation by the time 
Position is ogee them. When string 
Creases Sh. 5 constant, performance de- 
Decause th 1 distance into the periphery, 
I$ higher ee probability of a perturbation 
etters are n periphery, and because those 
their featy as likely to have had all of 
er tithanes | extracted in the first place. 
? because " ix elevated at String Position 
lons on d n letter is subject to perturba- 
The eee y one side. . 
9f the kis step in evaluating the viability 
introducin el was to examine the effects of 
Were ini into the strings. Spaces 
8tterg ide uced by removing the two 
tters ne the fifth letter or the two 
sequent to it. Processing was 
to take place in a left-to-right 
As was discussed in the empirical 
' Prespaces provide a report Or er 
age to the fifth letter that post- 


a 


€ctio 
n 
advant 


spaces do not. As an empirical control for 
this discrepancy, unreported As were intro- 
duced into some strings. It is not clear 
how best to simulate the effect of an un- 
reported A in the model. As a solution, 
the fifth letter on a prespace trial was 
the same position in the processing 
order as the fifth letter on a postspace trial. 
The simulated space effects are presented 


in Table 1. Except for the prespace data, 


these curves can be compared with roughly 
similar data in Estes and Wolford (1971) 
(1974b). 


or in Wolford and Hollingsworth 

The prespace curves should differ slightly 
because of the absence of the report order 
advantage in the simulation. 

The effects of spaces in the simulation. 
were similar to those in the empirical data. 
Spaces near the fovea were relatively in- 
effective, although they were more effective 
than in the empirical data. Spaces were 
more effective away from the center, and 
spaces on the peripheral side of a letter 
were superior to spaces on the foveal side. 
The intuitive reasons for the spaces effects 
in the simulation are straightforward. 
Perturbations, by design, are more fre- 
quent in the periphery and are more likely 
to occur in the direction of the fovea since 
hı was considerably larger than h; In 
addition, with the decision rule used, per- 
turbations to a feature group are more 
damaging than perturbations from a group. 


Losing features preserves the group as a 
hile adding features to 


assigned 


subset of the letter, W. 
a group often destroys the subset 
relationship. 
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TABLE 2 


VALUES FOR PROPORTION CORRECT AS A FUNCTION 
or SERIAL PosITION IN THE CONFUSION 


EXPERIMENT 
Serial position 
Values 

1 2 3 4 5 
Observed 94 90 76 54 67 
Simulated "» " 
Tm rule 99 91 70 50 57 
Old rule 97 76 45 24 36 


The simulation was also applied to the 
report paradigm described in Wolford and 
Hollingsworth (1974a). Subjects were 
presented with five-letter consonant arrays 
beginning at the center of the fovea and 
extending into the right visual field. Sub. 
jects were instructed to process in a left-to- 
right order and to always output five con- 
sonants on every trial. The simulated and 
observed serial position curves for that 
experiment are presented in Table 2. The 
letters did not have to be output in the 
correct order to be scored as correct. The 
same parameter values were used as in the 
previous simulations. The decision rule, 
however, was altered, since the subjects 
were forced to output a letter for every 
letter presented. The new rule was as 
follows: If the features in a group matched 
a letter perfectly, that letter was output; 
if the features in a group were a subset of 
one or more of the consonants, the letter 
with the highest Percentage of features in 
ers p A 25 A Du; 

ant was chosen from 

among all consonants with the same per- 
centage of features in common. Table 2 

lso contains i : 

the Pee a doe E values, with 
: Y 4 € used in the 
previous simulations. As jg evident in 
Table 2, the new decision rule provides a 


much better fit to the data than the con. 
servative rule. 


In order to examine th 
made in the Wolford and Hollingsworth 
(1974a) experiment, confusion Matrices 
were constructed from the errors for both 
the observed and simulated data, Those 
matrices were then clustered, using John. 
son’s (1967) nonmetric hierarchical cluster- 


€ types of errors 


GEORGE WOLFORD 


ing technique. All of the letters that 
clustered together above the fourth smallest 
diameter for the two clustering outputs are 
presented in Table 3. Letters that appear 
on the same row in Table 3 tend to appear 
as errors for each other in the data. Al- 
though there are many differences between 
the two outputs, there is a general similarity 
and a predominance of visual confusions 
in both cases. That similarity reflects pri- 


marily on the choice of the feature space 
and the decision rule, 


POTENTIAL EXTENSION OF THE MODEL 
TO OTHER PARADIGMS 


The perturbation model was specifically 
rmulated to provide a quantitative ac- 
count of the data from tachistoscopic 
experiments using the full-report procedure. 
The correspondence between the predicted 
and observed data, while by no means per- 
fect, is sufficient to warrant further atten- 
tion. There exist, however, numerous 
paradigms other than the report paradigms 
considered here. In fact, much of the recent 
literature has tended to focus on detection 
paradigms. Regardless of how well the 
perturbation model succeeds in fitting the 
data from report paradigms, it would be 


less than fruitful to invest further labor in 
it if the model were grossly inconsistent 
with the da 


i ta obtained from other para- 
digms. In this final section, therefore, an 
attempt is made to show that the model i$ 
Potentially consistent with data from other 
paradigms, 

Sperling (1959) developed a partial re- 
Port paradigm in which subjects were pre- 
sented with displays containing three row® 


of letters, A tone was sounded subseque? 
to the offset of each display, indicatinS 


foi 


TABLE 3 


IN 
ERING Resurts ror ERRORS 
THE CONFUSION EXPERIMENT 


Summary OF CLusr 


Simulated errors 
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p m the rows was to be reported. 
binder —— facilitated compared to a 
tone c une condition, but only if the 
Second. The the display by less than a 
similar n O e model predicts 
have control x Subjects are assumed to 
When. dis c over their processing order. 
report task c^ is sounded in the partial 
mediately b taen, the subject could im- 
row if he R processing the indicated 
tone was "€ "i already doing so. If the 
cated row te T soon enough, the indi- 
cessing order uld be moved up in the pro- 
ance would 9 the average and perform- 
Was too ning facilitated. If the delay 
completed a , processing would have been 
effect, nd the tone would have no 
Ir Rcs 
to Maus im task the subject is asked 
target lette lich of k previously specified 
taining n sa appeared in a display con- 
lon tasks is menis, Performance in detect- 
ance in ee superior to perform- 
Sured as in aes procedure, when mea- 
Perturbation es and Taylor (1964). The 
etection p godel also predicts superior 
More efficient des istow for two reasons: A 
the process nt decision rule can be used, and 
Preferred cure move more rapidly. The 
Used by urea: rule is similar to the rule 
Perfect’ mat, e and Geisler (1973). Ifa 
ature oro ch is encountered between a 
that m al and one of the targets, then 
Occurs is is output. If no perfect match 
the tar e à subset of one and only one of 
Bronson is found among the feature 
Other iique that target is output. In all 
he appro , subjects choose randomly from 
able to A targets. It seems reason- 
More senate te that the processor moves 
nly has ^ y in a detection task, since 1t 
With the t © compare each display element 
entire al arget letters rather than with the 
( prp as in a report task. Estes 
eerie d that latencies increased with 
More ra idl 1e size of the target set. The 
S sas. y the processor moves through 
tion; es the less the amount of perturba- 
the Sie, ce improved performance. For 
Seria] reason one would expect flatter 
an —— curves in detection tasks 
eport tasks. 


Mereag 
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As mentioned earlier, Estes (1972) found 
that detection performance did not vary 
with display size if nonconfusable noise 
were used. Two letters would 
presumably be confusable only if they 
shared common features. Performance did 
decrease with display size with confusable 
noise items. Performance in the perturba- 
tion model decreases with increases in the 
number of confusable noise elements, be- 
cause the target is moved farther out on 
the processing order on the average, and 
there is increased opportunity for one of 
the noise items to be confused with the 
incorrect target. Performance, however, 
does not decrease as nonconfusable items 
areadded. The target is still moved farther 
out on the processing order on the average, 
but since features move over time rather 
than decay, performance does not deterior- 
ate. If a subject finds one of the target 
features in a feature group, he knows that 
it came from one of the targets, since the 
noise letters by definition had no such 
features. Retinal location would be the 
only variable affecting performance in the 
nonconfusable case. If enough noise ele- 
ments were used so that the target was 
often moved far into the periphery, per- 
formance might deteriorate. The retinal 
extraction function, Equation 1, is quite 
flat, however, in the area covered by the 
displays in Estes (1972). 

Gardner (1973) designed an experiment 
to distinguish between his unlimited- 
capacity model and Rumelhart's (1970) 
limited-capacity model. Subjects were 
presented with two detection tasks on each 
trial, represented by the two rows of a 
display. Difficulty within a row was 
varied by the presence OF absence of a 
confusable noise element. Neither the 
noise nor target elements between rows 
were confusable. All elements were equi- 
distant from the center of the fovea and 
were separated by more than a degree of 
visual angle. Gardner's choice of spacing 
was based on the finding that letters separ- 
ated by a degree or more do not appear to 
interact (Collins & Eriksen, 1967). 
Gardner's main finding that performance 
in one of the tasks was not affected by the 
| of difficulty in the other task was 


letters 


leve 
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claimed to be inconsistent with po 
attention distributions in Ll — 
model. The perturbation = hee 
yield predictions similar to j r 
results. Perturbations between ys i 
groups are a function of distance m poul 
be assumed to approach zero with A pe 
of spacing. Processing order wou on 
matter then, and the difficulty in one tasi 
would have no bearing on the other. 
Difficulty within a task affects perform- 
ance, due to an increased probability of a 
confusion with the incorrect target. In 
essence, the perturbation model reduces to 
Gardner's independent parallel-channels 
model when retinal locus is held Constant 
and the spacing between letters is suffici- 
ently wide. , 
As mentioned before, processing order is 
not a relevant variable when spacing is 
sufficiently wide to prevent perturbations. 
For this reason the model is also consistent 
with the finding by Shiffrin and Gardner 
(1972) that sequential exposure was not 
superior to simultaneous exposure in their 
detection task. When spacing is wide, the 
perturbation model behaves very much 
like an unlimited-capacity model. When 
spacing is close, processing order serves a 


role similar to the distribution of a 
tion in Rumelhart's model. 


based in Rumelhart's mode 
upon both time and distan 
turbation model. 


There are findings which are inconsistent 
with the perturbation model as currently 
formulated. One such finding is that the 
percentage of transposition errors is lower 


in the model than in the real data, In 
Wolford and Hollingsworth (19742), ap- 
proximately one-third of the 


tel errors were 
transpositions. The Perturbation model, 


in simulating that experiment, made fewer 


transposition errors. Another troublesome 
finding comes from 


an experiment by 
Townsend et al. (1971). They presented 
subjects with linear arrays at long exposure 
durations (up to 3 seconds) while Prevent- 
ing eye movements. The serial position 
curves in their data were quite similar to 
the ones presented earlier, One would 
probably expect the feature extraction pro- 


cess in the perturbation model to continue 


tten- 
Decay is time- 
l but depends 
ce in the per- 
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: jield 
throughout the exposure duration and n: 
a fairly flat serial position HEU: du 
resolution of the model with these ae 
will have to await further research into ect 
effects of long exposure durations eG 
movements. There is also no e pe 
at present, for handling the effects o senal 
text, which can be quite powerful in Tosti 
information processing (Mewhort, 
Rumelhart & Siple, 1974). . ic he 
The last issue to be considered Bd we 
relationship of the perturbation mode h«c 
other existing models and to other poten e^ 
formulations. As mentioned earlier, poe 
of the quantitative models discussed Pris 
viously has been formulated to deal ec 
serial position effects in report uri i 
If one deals with detection paradigme in 
perturbation model would have much id 
common with all three models, but e s 
be particularly similar to the models € * 
veloped by Gardner (1973) and Shifiri 
and Geisler (1973). hie 
The primary difference between wn 
perturbation model and earlier quantitativ? 
models is the addition of a serial process 
and the specification of the perturbatio? 
Process. A serial processor has bee? 
postulated by a number of other investi 
gators and was considered necessary ' d 
account for the potent effects of report 
Processing order, Jt may be possible s 
formulate a parallel processor which wo! ot 
as well or better, but, at present, it is ge 
Clear to us how to do so. The perturP? 


» * aca 
tion process was chosen instead of a de 
Process so that 


> 0" 
x the model would poles. 
tentially consistent with many 2 Hees 
detection experiments (Collins & Erik 
1967; ( 


ri 
vardner, 1973; Shiffrin & Gard 
1972). As Was discussed previously: 4. 
chose to model the various report Pa 

digms without reference to capacity ua n 
in Short-term memory, Some interac 

with short-term memory may be red" os 
to account for the number of transP ut 
tion errors in Our report experiment? “to 
any very potent role would be diffictl! jg 


reconcile with the data presente 
Figure 1, 


The fact th 
qualitatively 
paradig 
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. . o 1 F 
consistent with many ay 1 
™S considered here in no Wa 


; jel 
at the perturbation m0 Í 


x 


Lx" 
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eei eid . it could provide adequate 
one e accounts of the data from 
used plo ^ pue Most of those paradigms 
that b. m k p and display shapes 
base Pic mec ifferent from the empirical 
se om c 2 this article, and different 
edie ca ant perhaps functions would be 
does provid — such fits, The model 
réport sea: fairly good account of the 
sterne EE ein considered here and 
level sib eim y consistent at a qualitative 
researdh is a ora paradigms. Further 
dirent Maso, 7 in order to obtain more 
postulated in = or some of the processes 
1e model. 
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Role of the Hypothalamus in the 
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Regulation of Food 
Intake 


Sebastian P, Grossman 
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A review of recent ex 
hypothalamic control of simple a 


and adipsia than is commonly 
tions of rats with such lesi 
ventromedial hypothalamic lesion s 
eating and obesity may be related, 


THE CONTEMPORARY MODEL or 
HYPOTHALAMIC REGULATION 

Little of significan 

the brain mechanisms 


(1940) demonstrated u 


animals return to a 
intense Overeating when st 
body weights (Brobeck, 1946), 

lateral hypothalamic lesions often 
Seemingly normal food and water 
when Properly nursed through wee 


normal 

ats with 

recover 

intake 
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perimental findings suggests th. 


ave been implicated, Th 


at the classic model of 


Ppetitive motivational states may be based, 
on possibly faulty interpretations of 


the effects of lesions in 


appears 
avior because 
not synapse on local neu- 
n syndrome, fiber Systems 
stem that constitutes the extra- 
his raises the 
ortant role į 


often months of aphagia and adipsia, but 
their Ingestive behayi 


be under Control of th 
which determine int 


the intact rat 
1964; Stricker x 
t is 8enerally accepted that 
: e which occurs in these 
animals is related mainly to the ingestion 
aboratory food (Kissileff & Epstein, 

ayed reactions to experi- 
YPovolemia and cellular 
been observed (see 
"he feeding behavior "s 
em to have recovered from 
ns in the lateral hypothalamus 


į R ch 
i. *SPOnSive to stimuli whi 
elicit food intake j 


e in the normal ani 
A do n eat in response to insu e 
pstein & eitelba r 
um, 1967) no 
the glucose ) 


analogue 


-deoxy-D-glucos? 
(Wayner, Cott 2-deoxy-D-g 


» Milner, & Tartaglion" 
Y Overreact to dietary e 
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Th . A 
intake derenvatious suggest that the food 
motivated = recovered animal may be 
1 -— 
regulated b hat ood intake may be 
thalamic m Fos pair of interacting hypo- 
Version "pem hanisms. 'The most detailed 
pióposes 4l such a model (Stellar, 1954) 
PrODDFfions the motivation to eat may be 
center” pts the activity of a "feeding 
that the “s 7 lateral hypothalamus and 
medial hy ced center" of the ventro- 
exerting P oe influences feeding by 
eral hypoth i itory effects on the lat- 
thought to We eris Both mechanisms are 
informatio collect neural as well as chemical 
the Orgànis about the nutritional state of 
only from sm and to receive inputs not 
Other ur EE sources but also from 
inputs and $ of the brain. Although the 
Centers w outputs of these hypothalamic 
hypothesis T, specified only vaguely, this 
and has i "ae enjoyed wide acceptance 
Cess in eae n heuristic suc- 
laboratories, ing the research of many 
h 

Bitte pm of this research have, in 
hypothal Onfirmed the hypothesis that the 
the Meso plays an important role in 
However ation of food and water intake. 
temporary she unique position which con- 
Indeed, the BEOKIES have accorded to it and, 
Fett have Pee ing of the influence it may 
“ndamental! ri questioned. The most 
he results P important issue raised by 
the possibilit recent investigations concerns 
9f lesions j : hes some or all of the effects 
to an ees b ic hypothalamus may be due 
Synapse in pe of fibers which may not 
Concern jg et area, A related area of 
alamic the „possibility that the hypo- 
Unique Ae eg may not perform 
Merely Bi alive functions but represent 
System of al points in a broadly distributed 
and dri a tne interacting, feeding- 
Many or ing-related pathways. Since 
Course th the diffuse fiber systems that 
Portant rough the hypothalamus are 1m- 
Otor components of the extrapyramidal 
System and related sensory path- 


Ways 
' the possibility must be considered 
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more seriously than it has been in the past 
that sensory or motor influences may play 
a major role in the disturbances in feeding 
or drinking that are observed in animals 
with hypothalamic lesions. 


A POSSIBLE ROLE OF EXTRAHYPOTHALAMIC 
MECHANISMS 


The uniqueness of the hypothalamic 
mechanisms has been questioned in a flurry 
of papers reporting that lesions in several 
extrahypothalamic structures produce ef- 
fects on food and water intake that are, in 
many cases, as severe, persistent, and ap- 
parently specific as those seen after hypo- 
thalamic damage. I reviewed this litera- 
ture in 1972 (Grossman, 1972) and shall 
not attempt to duplicate this effort here. 
A few examples and a review of some recent 
developments may, however, be useful. 

Complete and persistent aphagia and 
adipsia have been reported following lesions 
in the globus pallidus (Morgane, 1961a), 
midbrain tegmentum (Blatt & Lyon, 1968; 
Gold, 1967), and substantia nigra (Unger- 
stedt, 1971). Less persistent aphagia and 
adipsia have been seen after temporal lobe 
damage (Pubols, 1966) and after lesions in 
the lower brainstem or basal thalamus, 
which interrupt projections of the trigemi- 
nal nerve (Zeigler & Karten, 1974). 
Hyperphagia has been reported after lesions 
in the mammillary region (Graff & Stellar, 
1962), frontal lobe (Richter & Hawkes, 
1939), temporal lobe (Grossman & Gross- 
man, 1963), thalamus (Schreiner, Rioch, 
Pechtel, & Masserman, 1953), and peri- 
aqueductal gray of the brainstem (Skultety 
& Gary, 1962), following chemical or surg!- 
cal lesions in portions of the mesencephalon 
that are traversed by the ventral nor- 
adrenergic bundle (Ahlskog & Hoebel, 
1973) and after surgical transection of the 
anterior (Grossman & Grossman, 1970), 
lateral (Albert & Storlien, 1969; Sclafani & 
Grossman, 1969), and posterior (Grossman 
& Grossman, 1970) connections of the 
medial hypothalamus (discussed later). 
Hyperdipsia has been seen after some 
brainstem lesions in the vicinity of the 
interpeduncular nucleus (Kent & Gross- 
man, unpublished observation, 1974), after 
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surgical transection of the posterior fiber 
connections of the ventromedial hypo- 
thalamus (Grossman & Grossman, 1970), 
and after septal lesions (Harvey & Hunt, 
1965). 

There is some evidence that at least some 
of these structures may be arranged in 
series (to borrow an analogy from electronic 
circuits). For example, the mild and 
temporary effects of a unilateral lesion in 
the lateral hypothalamus are converted 
into severe and persisting aphagia and 
adipsia not only by a contralateral hypo- 
thalamic lesion but also by a unilateral 
lesion in the contralateral tegmentum 
(Gold, 1967) or globus pallidus (Krantz & 
Teitelbaum, cited in Teitelbaum, 1967). 
A similar interaction has been reported 
between the effects of lesions in the mid- 
brain and in the subthalamus and zona 
incerta (Blatt & Lyon, 1968). Additive 
effects of hyperphagia-producing asym- 
metric knife cuts and lesions have also been 
reported (e.g, Gold, Quackenbush, & 
Kapatos, 1972). 

An observation by Walsh and Grossman 
(1973) suggests that the effects of lateral 
hypothalamic lesions on water intake, as 
well as the impaired or absent feeding 
response to cellular glucoprivation, may in 
fact be the result of damage to tissues or 

fibers of passage of the zona incerta rather 

than to the hypothalamus itself, We found 
that small lesions in the anterior aspects of 
the zona incerta, which do not involve 
tissues of the lateral hypothalamus, com- 

1 pletely abolish regulatory drinking without 
affecting food intake, The effects of these 

. lesions may have escaped observation to 
date because animals with zona 
. lesions drink nearly 
|. Water when given ad- 
laboratory chow and 

.. when food deprived. 
= behavor of rats that have recovered from 
the effects of lateral hypothalamic lesions, 
but we have noted one in teresting exception 
(Walsh & Grossman, 1974). Whereas rats 
with lateral hypothalamic lesions do not 
drink in response to either intracellul 
dehydration or extracellul 
(although they may m 


incerta 
normal quantities of 


libitum access to dry 
display adipsia only 
This duplicates the 


ar 
ar hypovolemia 
ake long-term ad- 
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justments in intake, Stricker, 1973; Stricker 
& Wolf, 1967), rats with zona incer ta 
lesions show a selective impairment - 
their response to cellular dehydration an 
react normally to extracellular hypovole- 
mia. A similarly pure regulatory deficit 
has been reported by Blass and Epstein 
(1971) in the rat and by Peck and Novin 
(1971) in the rabbit after bilateral ap 
in the lateral preoptic region. In the bee 
some of these lesions selectively abolishe 
the response to cellular dehydration ; others 
inhibited drinking in response to extracellu- 
lar hypvolemia as well. 

Walsh and I have investigated the pei 
tionship between the zona incerta an 
adjacent tissues of the preoptic region, 
hypothalamus, and thalamus, which have 
all been implicated in food- or water-related 
regulatory functions. We found that knife 
Cuts which sever the direct connections 
between the zona incerta 
£ion reduced water int 
of food deprivation ( 


and preoptic re- 
ake during periods 


although not as dras- 
tically as zona incerta lesions) but had no 


effect on drinking in response to systemic 
injections of hypertonic saline or polyethy- 


lene glycol. Cuts beneath the zona incerta, 
which disrupt 


its connections with the 

lateral hypothalamus, not only interfered 
With the response to cellular dehydration 
but also inhibited the drinking response tO 
extracellular hypovolemia, which we found 
to be unaffected by zona incerta lesions- 
his suggests that the zona incerta as wel 
as the lateral Preoptic region may contain 
components which are important for ap- 
Propriate reactions to extracellular. anc 
cellular dehydration. The fact that out 
lesions in the anterior portions of the zon 


incerta (as well as some of Blass an 

. i 
Epstein’s lateral preoptic region lesions 
selectively affected 


ar 
y aff the response to cellula 
dehydration indicates further that in bo 


5 à e 
areas the two mechanisms may be, to a, 
extent, anatomically distinct. This is fuf" 


ther Supported by our observation vai 
Circular knife cut 


produced small 


effects on drinki 
cellular hy 
water inta 


. ox tra” 
Ng m response to CX! of 
Povolemia but had no effect 


ke after hypertonic saline. 
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e unte Duns gl peiie 
Teceptors hid ee pc hid sodium- 
State of cellulbr bod believed to monitor the 
is, e nl ration, may be located 
region, Wes ona: a and lateral preoptic 
that ceri fecti ipsc. U9) reported 
hypertonic suc Axis OF Hypertonie saline or 
lateral preo ucrose into some sites in the 
Sated rats e ides elicited drinking in 
distilled eed that similar injections. of 
in derived ; reduced water consumption 
(1971) noted animals. P Peck and Novin 
Walsh and I a effects in the rabbit. 
of hapeno — that microinjections 
elicited drinki Saline into the zona incerta 
hypertonic ing in sated rats but found 
stimulus ean S to be an ineffective 
observation » sh & Grossman, 1974). This 
Suggestion ih congruent with Andersson's 
tively nr central receptors, selec- 
to the re * x by sodium, may contribute 
inn tae of water intake (see 
materials). , for a review of related 
Nira ae tmd and Rolls (1970) re- 
renal diesen a oinjections of the powerful 
hypothalamus angiotensin into the anterior 
region prod S, preoptic area, and septal 
Suggesting E drinking in sated rats, 
Might also hat this region of the brain 
this com er ga the central effects of 
Dress) has iuc However, Johnson (in 
Pde ea shown that small doses 
cannula pl; in elicited drinking only at 
to the A. acements that involved injury 
mitted QU Emar system (and thus per- 
ventricles) a of the compound into the 
impson -- ie in conjunction with 
ation that outtenberg's (1973) obser- 
Organ, sli pee vn, of the subfornical 
System, e uem into the ventricular 
orig oae responses a 
i oh k giotensin, these findings sug- 
he Préontic subfornical organ rather than 
May be th c-anterior hypothalamic region 
tion, «4. as central site of angiotensin ac- 
Porte tis interpretation is further sup- 
Tecent bs Simpson and Epstein's (Note 1) 
“So abolish that subfornical organ lesions 
Nectig drinking responses to systemic 
ns of angiotensin. 
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There is, however, some evidence that 
the septal region may exert significant and 
specific influences on angiotensin-related 
thirst. Septal lesions result in persistent 
hyperdipsia (Harvey & Hunt, 1965), and 
Lubar, Boyce, and Schaefer (1968) have 
provided evidence that this effect of the 
lesion is not secondary to polyuria. Blass 
and Hanson (1970) have reported that rats 
with septal lesions responded normally to 
hypertonic saline challenges but drank 
more than controls when given subcutaneous 
polyethylene glycol injections. This sug- 
gested a selective facilitation of the response 
to extracellular hypovolemia, but a series of 
recent papers (Black & Mogenson, 1973; 
Blass, Nussbaum, & Hanson, 1974) have 
indicated that not all treatments that re- 
sult in extracellular hypovolemia are af- 
fected by septal lesions. Clear facilitatory 
effects were seen only when drinking was 
induced by renin or angiotensin or by 
treatments (such as caval ligation, hypo- 
tension due to isoproterenol, etc.) which 
affect the renin-angiotensin system. 

It is clear from this brief and necessarily 
selective overview of some recent develop- 
ments in this field that much remains to be 


about the functions of various 


learned 
es in the regula- 


extrahypothalamic structur 
tion of food and water intake. Itis already 
clear, however, thata number of subcortical 
regions outside the hypothalamus may 
exert many regulatory influences which 
were assigned to hypothalamic mechanisms 
only a few years ago. The emerging pat- 
tern suggests that we must take a closer 
and more critical look at the data that have 
led to the postulation of hypothalamic 


control. 
ARE FIBERS OF PASSAGE RESPONSIBLE 
Errects OF LATERAL 


FOR THE 
HYPOTHALAMIC LESIONS ? 


mental support for a 
of food and water 
es from studies that 
impairments after 


The principal experi 
hypothalamic model 
intake regulation com 


have shown severe 
lesions in the dorsolateral quadrant of the 


hypothalamus. This area is traversed by 
numerous diffuse fiber systems (e.g., com- 
ponents of the medial forebrain bundle, the 
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pallidofugal fiber system, nigrostriatal path- 
way, various corticofugal fiber Sys The 
and important sensory a na 
ON leons, Brus c of dies fiber 
Rope veis Mq: i ict with feed- 
E a tae f the destruction of 
ing, but not because o se rn 
cellular components of the esion site. : 
has also become increasingly clear that 
lateral hypothalamic lesions produce sen- 
sory and motor disturbances that may 
themselves contribute importantly to the 
impaired feeding and drinking behavior. 
'That the effects of hypothalamic lesions 
might be due to an interruption of fibers 
of passage rather than the destruction of 
cellular components of the area was first 
suggested by Morgane (19612), when he 
found that lesions in the globus pallidus, 
which interrupt the pallidofugal projections 
that pass through the lateral hypothalamus 
on their way to the lower brainstem, 
reproduced the effects of lateral h po- 
thalamic lesions. Gold (1967) supported 
this interpretation by showing that lesions 
in the brainstem tegmentum, which is one 
of the principal targets of the pallidofugal 
projection system, also produced aphagia 
and adipsia, and that a unilateral lesion in 
that area, when added to à contralateral 
lateral hypothalamic lesion, also produced 
persistent aphagia and adipsia. Teitelbaum 
and Krantz's (cited in Teitelbaum, 1967) 
observation of similar additive intera. 


between unilateral hypothalamic and pal- 
lidal lesions further Streng 


r an interruption of 
the pallidofugal fiber System, which has no 
known syn 


apses in the area of the feeding 
center. 


Ungerstedt (1971) has more r 
implicated the nigrostriatal pathway, which 
ascends through the lateral hypothalamus 
and adjacent portions of the internal 
capsule without known Synapse. He re. 
ported that electrolytic lesions of the 
substantia nigra, or microinjections of 6- 
hydroxydopamine (6-OHDA), Which selec- 
tively destroy the catecholaminergic com- 
ponents of the region, Produce persistent 
aphagia and adipsia (as well as akinesia 


ecently 
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> 'g hypo- 
and reduced alertness). U i mendi p 
thesis that the nigrostriatal pea Be 
be responsible for the effects cs water 
hypothalamic lesions on food = earlier 
intake is not incompatible with the wm 
observation that lesions | in the S psi 
pallidus result in aphagia and qe 
because major components of the ‘lobus 
striatal pathway course through the P date 
pallidus before terminating in the pe 
and putamen (Ungerstedt, 1971). Ham 
in the globus pallidus (Andén, d hy- 
berger, & Hókfelt, 1966) and the ee 
pothalamus (Glick, Greenstein, & de lete 
1974; Oltmans & Harvey, 1972) 2 
striatal dopamine, proof positive. tha vera 
pathway is, indeed, interrupted in se an 
of the lesions that produce aphagia 
adipsia. beet 
Ungerstedt's observations have the 
replicated and related not only to. si 
typically transient aphagia and adip in 
but also to persisting regulatory dehors. 
food and water intake. Fibiger, Zis, e 
McGeer (1973) have reported that 1nJ a 
tions of 6-OHDA into the substantia n$ 
Produce a transient (approximately in£ 
days) aphagia and adipsia, Follow i 
recovery of voluntary ingestive behave. 
the animals showed many of the persis ral 
deficits that characterize rats with ee 
hypothalamic lesions. Drinking occur ie, 
mainly in association with food int@® | 
finickiness was increased, the response 
experimentally induced intracellular 
ydration was absent, 
genic effects of 


and the anor fc 
amphetamine were pn A 
cantly reduced. Marshall and his psc 
(Marshall, Richardson, & Teitelb ve 
174; Marshall & Teitelbaum, 1973) and 
Observed more Persistent aphagl? pto 
adipsia after injection of 6-OHDA "rus 
the substantia nigra or into the E dual 
pallidus, These animals showed the gratel- 


Toit?! 
recovery of voluntary intake that 1€ 
baum and Epstein 


. (1962) observed in pey 
with lateral hypothalamic lesions. gel? 
subsequently displayed persisting , ation 
in their response to cellular glucopriv® a 
cellular and extracellular dehydrati onal! 
food deprivation, Which were esse? pe 
indistinguishable from those seen 


met 
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pe lesions of the lateral. hypo- 
amus (the 6-OH DA injections duced 
Somewhat less finickines S pone 
of these experi Be iness). The results 
Dornitanunr e de not, unfortunately, 
the role of th * i ed conclusion concerning 
fiber &ystem 1 qe pem pee nigrostriatal 
lation. bex in food and water intake regu- 
into the dan the injection of 6-OHDA 
Klobis Lb stantia nigra, as well as the 
tions cq produced marked deple- 
tine (as w cg and striatal norepineph- 
dopamine) p^ as sizable depletions of 
correlated weil ne duration of the aphagia 
depletion in tl with the extent of dopamine 
of the persi he striatum, but the severity 
Prisingly — regulatory deficits, sur- 
between thed not. (A similar correlation 
and the deg uration of aphagia and adipsia 
pletion Ec ge of striatal dopamine de- 
ateral hy wel natch dan lesions in the 
läns R eal has been reported by 
mayer, and EN Ce doe Smith, Stroh- 
Mections of ro (1972) have reported that 
ateral test -OHDA directly into the 
and adipsia et ae produced aphagia 
as peite although the effects were not 
lateral h ene Ba those often seen after 
ity and e api lesions. Their sever- 
with the Son appeared to be correlated 
reducing iii of the injections in 
rather thar typothalamic norepinephrine 
Tiesa an striatal dopamine. 
Horadrenen eS gations demonstrate that 
Blume sir components of the hypo- 
affected b à riatum, and telencephalon are 
ateral i 6-OHDA injections into the 
nali, P fe as well as into 
Nigra, th ng tissues such as the substantia 
This ig “ globus pallidus, or the ventricles. 
a potentially important problem, 
i . ae mechanisms have 
-chavior icated in the regulation of feeding 
erhaps p: other experimental paradigms. 
“Vidence ne most substantial body of 
ergie po uggesting that central noradren- 
Zatio Pathways participate in the organi- 
E is pate in t rg: 
om the ir ang: related behaviors comes 
Nul E that have shown 
js. injections of adrenergic com- 
nceph id related. drugs into the di- 
: eetivel,. elicit or inhibit [ood intake 
Y. The initial experiments (Gross- 


lat 
Boung 
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man, 1960, 1962a, 1962b) demonstrated 
that the application of norepinephrine to 
the perifornical region of the hypothalamus 
elicited eating in sated rats, and that 
injections of compounds that interfere with 
the action of norepinephrine blocked this 
effect and inhibited feeding in deprived 
animals. Subsequent investigations of this 
phenomenon (e.g., Booth, 1967; Coury, 
1967; Grossman, 1964a, 1964b; Leibowitz, 
1970) have shown that feeding can be 
elicited by injections of alpha adrenergic 
into various other parts of 
the hypothalamus, as well as into sites 
anterior and dorsal to it. The doses used 
in the initial investigations were quite 
high, so that the possibility of a nonspecific 
drug action could not be excluded. How- 
more recent studies (e.g, Ritter & 
2; Leibowitz, Note 3) have 
succeeded in reducing the dosage to more 
nearly physiological levels, and the con- 
clusion that feeding may be influenced by 
noradrenergic components of the upper 
brainstem appears well supported at this 
time. 
Reports that intraventricular injections 
of 6-hydroxydopa (which deplete brain 
norepinephrine but not dopamine) produce 
aphagia (Richardson & Jacobowitz, 1972) 
and that intraventricular injections of 
norepinephrine reverse the aphagia seen 
in rats after lateral hypothalamic lesions 
(Berger, Wise & Stein, 1971) are also in 
agreement with such an interpretation. 
The results of a number of recent investi- 
gations do, however, suggest that the 
destruction of dopaminergic pathways may 
be responsible for many of the disturbances 
in food and water intake that are seen 
after intracerebral injections of 6-OHDA. 
Ungerstedt (Note 4) has recently re- 
ported that the administration of dopamine 
feeding in rats that are 
aphagic as à result of 6-OHDA injections 
into the substantia nigra. Zigmond an 
Stricker (1972) reported earlier that intra- 
ventricular injections of this compoun 
depleted brain by 95% 


norepinephrine 
(but dopamine by only 62%) and produced 
only mild and trar 


xsient hypophagia. When 
animals were 


compounds 


ever, 
Epstein, Note 


agonists elicits 


the pretreated with the 
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i vydase inhibitor pargyline, 
CO ape un of norepinephrine was 
Bheers d, but dopamine was reduced to 
oe ples and more persistent aphagia 
o 
ipsia occurred. An . 
E extension of this rd 
tion, Stricker and Zigmond (1974) d 
served that rats given intraventricular 
injections of 6-OHDA plus a monoamine 
oxydase inhibitor progressed through the 
gradual and orderly recovery Stages de- 
scribed by Teitelbaum and Epstein (1962) 
for rats with lateral hypothal 
After recovery of voluntary 
animals failed to eat in respo; 
Blicoprivation and showed only the delayed 
drinking response to extracellular hypo- 
volemia that is typically seen in rats with 
lateral hypothalamic lesions. However, 
these animals responded promptly to 
hypertonic saline challenges and drank 
normal quantities of water during periods 
of food deprivation. This apparent Sparing 
of at least some of the mechanisms that 
regulate water intake is Surprising in view 
of recent reports of persisting deficits after 
6-OHDA injections? into the Substantia 
nigra or globus pallidus (Fibiger et als 
1973; Marshall et al., 1974), 


The effects of intraventricular 
of 6-0 


amic lesions, 
intake, these 
nse to cellular 


hypothalamic lesion syndrome may 
related to an in i 


fibers of Pass 
interesting questions and Suggest that this 


ire Story. One May ask, 
y voluntary food and Water 


even though the Opamine 
levels of the striatum remain Near zero 


levels in these experiments, or wh ! 
hypothalamic lesions, which typic: 
in much less complete depletions of striatal 
dopamine (Glick et al., 1974. 

Harvey, 1972), have 
persisting effects on food 
In this context, one 


be 


and water intake, 


also must not lose sight 
of the fact that aphagia and adipsia have 


been reported after lesions in portions of 
the brainstem that are not traversed by the 
dopaminergic nigrostriatal System (e.g, 
Lyon, Halpern, & Mintz, 1968; Parker & 
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Feldman, 1967; Zeigler & Karten, D 
Although it is possible that ig Sit 
may interfere with afferents to ae 
stantia nigra, it is unlikely that they arcing 
result in the massive striatal ete to 
depletions that are apparently. pue an 
produce persistent changes m v brain- 
water intake. The effects of these al, or 
stem lesions on hypothalamic, w-— 
telencephalic norepinephrine are, un 
nately, not known at this time. at the 
These considerations suggest iss s on 
effects of lateral hypothalamic uut 
food and water intake may not be sii 
determined by the destruction of ae ; 
regulatory mechanism but may, bance 
reflect a complex syndrome of distur ee 
in several functions, which may be me 
by several distinct fiber systems. "m 
We have begun to investigate the nnect 
tions of the fiber systems that interco some 
the brainstem and basal ganglia 1.1973) 
detail (Grossman & Grossman, 1971, of à 
We find that surgical transections lamus 
fibers that enter or leave the ipee nitro” 
laterally (including pallidofugal v; vonnet 
striatal Pathways and the REC ptoee : apo" 
tions between the brainstem and eater’ 
lobe) Teproduce the effects of sl 
hypothalamic lesions in every respe yg ten” 
these experiments, a retractable Sune used 
Wire knife of 125-um diameter WI plane 
to make à cut in the parasagitta terna 
slong the medial border of the za mm 
Capsule, In the rat, a cut that is < 
in length 


the bottom of 
Most of the | 
hypothal one” g 
cant direct damage to cellular par^ fin 
of the amus or striatum. V hag" 
th produces long-term a E 
: à which in many instances ite E. 

hin 90-120 days, in SP' o : 


a 


" C z 
Animals that re erg 


» 1n9 7 ol 
Y food anq water intake Ute 
adual reco 


e 
very described by vith 
baum and Epstein (1962) for wc uf 
lateral hypothalamic lesions. In rat 
Stage of this recovery process, or UN 
consume ade amounts of a pal. 


the gr. 
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B oe diet and are, indeed, hyperphagic 
D ess Lt liquid diet is offered. 
of hy vane promptly return to a state 
Duell d or aphagia when a less 
Bie er ry, powdered diet (which normal 
S eat readily) is presented or when 
quinine is added to a palatable liquid di 
his pattern of i Palate liqui iet. 
tliat these ec ingestive behavior suggests 
konsilia ng quw animals are extremely 
diet, and zd the sensory qualities of their 
Ree ee is reason to believe that 
perhaps s 7 may be the principal and 
in these s le determinant of food ingestion 
The | animals. 
Meane Le of central glucoreceptor 
by the ol ae recently been questioned 
hypothal OK VALLON that large anterior 
us ndi amie lesions (Blass & Kraly, 1974), 
(Walsh apis DAVE to the zona incerta 
tion, 1974) rossman, unpublished observa- 
(Alheid & Ew ventromedial mesencephalon 
tion, 1974) s pesman; unpublished observa- 
cellular jJ a bolish the feeding response to 
in oie Be without resulting 
ood hek disturbances in ad-libitum 
Delieveg that » is nonetheless commonly 
Its utilizatic 5 ood sugar or, more precisely, 
be an im On Dy central glucoreceptors may 
ood pees tant signal for the regulation of 
Part on = This hypothesis is based in 
injections observation that systemic 
Sugar Had insulin, which lower blood 
Utilization , presumably its intracellular 
alloway ; a Ch food intake (Mackay, 
Systemic P , Barnes, 1940), and that 
analogue grim. ui ue of the glucose 
Petes wig -deoxy-D-glucose, which com- 
tabolic 1 glucose for intracellular me- 
Smith pathways, produces similar effects 
Shown tl Epstein, 1969). It has been 
€Siong hat rats with lateral hypothalamic 
(Epstein. bii: respond to these treatments 
on 1971), yee baum, 1967; Wayner et 
Ccovered p. similarly find that rats that 
of , from the effects of transections 
lateral connections of the hypo- 
$ did not eat in response to insulin 
Man, 19 Y-D-glucose (Grossman & Gross- 
fai ion Indeed, our animals also 
in te food intake reliably in 
* to changes in environmental tem- 


Der. 
ature 
res—the only regulatory response 
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which appears to be intact after lateral 
hypothalamic lesions (Epstein & Teitel- 
baum, 1967)—or injection of 6-OHDA into 
the substantia nigra or globus pallidus 
(Marshall et al., 1974; Marshall & Teitel- 
baum, 1973). 

An examination of the water intake of 
our rats following apparent recovery of 
voluntary intake suggests that they do not 
regain any regulatory capabilities following 
surgery but drink water only to facilitate 
the ingestion of dry food. Water intake 
falls to zero during periods of food depriva- 
tion, and there is no response to treatments 
that elicit or increase drinking in the 
normal rat. 

Drinking occurs mainly in response to 
the development of intracellular dehydra- 
tion or to a decrease in the volume of 
extracellular fluid. The operation of the 
first mechanism is readily demonstrated in 
the intact rat by the subcutaneous or 
intraperitoneal administration of hyper- 
tonic saline, which results in cellular 
dehydration and elicits water intake (e.g. 
Corbit, 1965). Extracellular hypovolemia 
can be experimentally produced by sub- 
cutaneous or intraperitoneal injections of 
hyperoncotic compounds, such as poly- 
ethylene glycol, which sequester extra- 
cellular fluids and result in intense thirst, 
which is not slaked by water (Stricker, 
1968). The two mechanisms become effec- 
tive at different times during ontogeny 
(Friedman & Campbell, 1974) and are 
believed to be independent. Drinking 1s 
also induced by the peripheral or central 
administration of renin, or its metabolic 
product angiotensin, and by a variety of 
pharmacological or physiological treat- 
ments that induce the release of this 
hormone from the kidney. There is also in- 
creasing evidence that hypotensive agents 
such as isoproterenol elicit thirst by acting 
on still a fourth and possibly independent 
mechanism (see recent reviews by Epstein, 
1973; Fisher, 1973; and Lehr, 1973). 

Rats with lateral hypothalamic lesions 
do not drink in response to extracellular 
volemia or intracellular dehydration, 
adjustments in intake 
see Stricker, 1973, 


hypo 
although long-term 


have been reported ( 
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for review), and a similar loss of regulatory 
control was seen in our rats that had 
recovered from the effects of knife cuts 
along the border of the lateral hypo- 
thalamus (Grossman & Grossman, 1973). 
Our observations indicate that an inter- 
ruption of the fibers that enter or leave the 
lateral border of the hypothalamus repro- 
duce, in detail, the effects of lateral hypo- 
thalamic lesions on food and water intake. 
This lends support to the hypothesis that 
possibly all of the effects of lateral hypo- 
thalamic damage on ingestive behavior may 
be due to an interruption of fiber systems 
that may or may not synapse in the area 
identified as the lateral hypothalamic ‘‘feed- 
ing region.” Since this area is traversed 
by several major fiber systems that inter- 
digitate extensively (see Dahlström & 
Fuxe, 1964; Guillery, 1957; Knook, 1965; 
Millhouse, 1969, 1973a, 1973b; Nauta, 
1957, 1958; Raisman, 1966; Ungerstedt, 
1971; Wolf & Sutin, 1966), it has been 
difficult to ascertain which of them might 
be responsible for the observed effects on 
food and water intake. We have made 
some progress in this respect, but much 
remains to be learned. 
Morgane (1961b, 1961c) 
number of years ago that 
interrupted the medial foreb 
anterior or posterior to the region of the 
lateral hypothalamic feeding center did 
not produce lasting aphagia or adipsia. 
Similar results have been reported by 
DiCara and Wolf (1968). We have found 
more recently that small knife cuts in the 
coronal plane, just rostral to the substantia 
nigra, produced long-term (>30 days) 
aphagia and adipsia (Kent, Rezak, & 
Grossman, 1973). A similar cut only .5 
mm rostral to this plane and even quite 
large cuts across the medial forebrain 
bundle, which severed most or all of its 
lateral hypothalamic components just be- 
hind the feeding center, had only tem- 
porary (1-6 days) inhibitory effects on 
food and water intake (McDermott & 
Grossman, unpublished observation, 1973). 
Paxinos and Bindra (1973) also reported 
normal food intake followi ng similar coronal 


cuts across the medial forebrain bundle. 


reported a 
lesions that 
rain bundle 
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Albert, Storlien, Wood, and Ehman (1970) 
have reported aphagia and adipsia s) 
very large coronal knife cuts anterior ah 
posterior to the lateral hypothalamic foci 
ing center, but it appears likely that he 
results were due to an involvement O 
fibers that travel in the internal capsule. 
The overall pattern of results seen after 
knife cuts across the medial forebrain 
bundle suggests that fibers that are relate ; 
to ingestive behavior may not ascend e 
far in the lateral hypothalamus itself, bu 
turn laterally and perhaps dorsally in the 
posterior diencephalon. Such an interpre 
tation is supported by the results of a" 
earlier investigation (Blatt & Lyon, 1968), 
which demonstrate additive effects on foo 
and water intake of asymmetric lesions n 
the midbrain and in lateral portions of the 
subthalamic region and parts of the zona 
incerta. 

We have found that par 
in the striatum and intern 
run parallel to those th 
aphagia and adipsia, do not produce these 
effects (Grossman & Grossman, 1971; 
Grossman & Grossman, unpublished obse" 
vation, 1973). Alheid and | (Alheid. & 
Grossman, unpublished observation, 197 ) 
have further observed that short para" 
Sagittal cuts along the lateral border of the 
Preoptic area and anterior hypothalamv? 
do not produce persisting aphagia td 
adipsia, even though they are in the sam 
Parasagittal plane as more posterior cu? 
Which are consistently effective. Negat!Y 
results were also observed in rats Pd 
lesions in each of the major nuclei of a 
amygdala and in the piriform corte 
which, in the rat, is probably the pe 
Source of the amygdalofugal paths; 
Grossman, Grossman, & Walsh, 19% 

emme, 1972). These results suse", 
that reciprocal Connections between ,. 


i 

t g 
brainstem and temporal lobe, inclu ai? 
ateral projections of the medial forebt pd 


bundle, the amygdalofugal pathway: ipl 
ansa peduncul 


Not responsi]; 
along the 
thalamus, 

re 


asagittal cuta 
al capsule, whic 
at result in long-ter™ 


t 


aris of Meynert, are prove its 
le for the effects of our © 0" 
lateral border of the d 
Paxinos (1974) has rece ofl 
ported results which support uis 
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E It is possible that the critical 
Bo a may turn rostrally immediately 
project nering the internal capsule and 
TA tae from cortical sites (Webster, 
Bi de d However, it appears likely, 
(19610 Sora of the work of Morgane 
stedt (197 1b, 1961c), Gold (1967), Unger- 
others” d 1) Marshall et al. (1974), and 
and E. hat components of the complex 
A necte the a system that inter- 
h A rainstem and striatum may 


Do S 
SENSORY AN 
Pray SORY AND MoroR DysFUNCTIONS 
i righ Important PART IN THE 
-ATERAL HYPOTHALAMIC 
SYNDROME? 


The palt; 

dang pn Tin and nigrostriatal projec- 
System inia important role in the feedback 
and bns s empires posture, locomotion, 
or review) = haa ala (see Laursen, 1963, 
Well as tho he effects of our knife cuts, as 
Pallidal, he ee ut lateral hypothalamic, 
ore, reflect oe lesions, might, there- 
rather than OKO or sensory disturbances 

an an interference with mecha- 


nisms ge 

thirst My related to hunger or 
lesiong Ed ats with lateral hypothalamic 
but are not only aphagic and adipsic 


also ; 

though ua extremely hypoactive—al- 

activity hey display starvation hyper- 
Y after deprivation 

They often 


play s ; 
Bite e similar to those seen in 
lon| patients (e.g., they remain mo- 


OsScin; (Balagura, Wilcox, & 
jun, o 09; Marshall, Turner, & Teitel- 
um, 1965 ; Rodgers, Epstein, 
M any of 5; Teitelbaum & Epstein, 

of these symptoms disappear OF 


Imin; 
“Minish wh; 
while 7 x 
an h while the animals are still aphagic 
idence 
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which Marshall and his associates (Mar- 
shall et al., 1971; Marshall & Teitelbaum, 
1974) have seen in association with aphagia 
and adipsia. Rodgers et al. (1965) have 
shown that rats with lateral hypothalamic 
lesions not only fail to feed and drink but 
also do not press a lever for intragastrically 
delivered food rewards. This observation 
supports a motivational interpretation, but 
it is not as conclusive as one might wish, 
because the animals’ ability to lever-press 
for other reinforcements (such as brain- 
stimulation rewards or shock avoidance) 
was not demonstrated. That this is an 
essential control is suggested by several 
reports of impaired brain-stimulation-re- 
warded or food-rewarded lever pressing in 
animals with lesions just rostral or caudal 
to the feeding center (Chase & Moore, 
1968; DiCara & Wolf, 1968; Morgane, 
1961b, 1961c). Severe impairments of 
escape and avoidance behaviors (Coscina 
& Balagura, 1970; Runnels & Thompson, 
1969) as well as food-rewarded or brain- 
stimulation-rewarded lever pressing (Mor- 
gane, 1961b, 1961c) have been reported 
after lesions in the lateral hypothalamic 
feeding center itself. The problem is 
emphasized by our own observation (Kent 
& Grossman, 1973; Kent et al, 1973; 
Alheid & Grossman, unpublished observa- 
tion, 1973) that many of the knife cuts 
which duplicate the effects of lateral hypo- 
thalamic lesions on ingestive behavior also 
abolish lever pressing (as well as other 
instrumental behaviors), irrespective of 
reward conditions. 
Investigators who hav! 
and adipsia by electrolytic or chemical 
lesions outside the hypothalamus have 
frequently commented on the severity of 
the accompanying motor disturbances an 
have often suggested that the disturbances 
in feeding behavior might not reflect moti- 
vational impairments. Morgane (1961b), 


for instance, suggested that lesions in the 
basic 


e produced aphagia 


globus pallidus might result in 

metabolic dysfunctions; Blatt and Lyon 

(1968) proposed that the aphagia and 
he midbrain 


fter damage to t 
ht be due to an interruption 


"normally aid in the 


adipsia seen a 
tegmentum mig 
of pathways which 
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iati ensory cues and motor 
NO SUM Nie finding, seizing, 
pe eme into the mouth. of food 
objects” (p. 576); Ungerstedt (1971) noted 
N V k and lack of alertness 
> pate after injections a iir ides 
ia nigra, and discusse 2 E 
icem this might be the cause ihe a 
served aphagia and adipsia ; and Schwartz- 
baum and his associates (Levine, Ferguson, 
Kreinick, Gustafson, & Schwartz pue 
1971; Levine & Schwartzbaum, 1973) ave 
made similar comments concerning the 
effects of striatal lesions on ingestive be- 
havior, following a thorough Study of the 
sensorimotor capabilities of rats with uni- 
lateral or bilateral striatal damage. Zeigler 
and Karten (1974) recently reported 
aphagia and adipsia after lesions in the 
brainstem and basal thalamus and s 
gested that these effects 
related to an interruption 
carried by the trigemina 
region of the mouth. 


Our own Observations of Tats that are 
aphagic and adipsic as a result of knife 
cuts lateral to or behind the feeding center 
do not, as yet, permit safe conclusions 
concerning the possible role of sens y or 
motor disturbances 


- We typically observe 
motor dysfunctions that are Much less 


pronounced and persistent than those 
produced by most p, alamic 
lesions, Hypoactivity nd catatoni 


ug- 
might be directly 
of Sensory signals 
l nerve from the 


a-re- 


Owever, aft 


1 : Tecovery of 
ary ingestive behavi 


V avior, our animals 
have apparent difficulties with gnawing, 
biting, chewing, and perhaps sw 


animal's cage 
To what extent these 
May contribute to the 
syndrome is, at the 


recessed, 
motor dysfunctions 
aphagia and 


adipsia 
moment, not cle 
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It is often argued (e.g., Epstein, D 
that the persisting regulatory defici eiu. 
are seen in rats with lateral hypoti "n ‘cuts 
lesions (as well as in our rats on in 
lateral to the lateral hypothalamus ) after 
rats with nigral or pallidal damage . not 
recovery of ‘voluntary intake, Tecate | 
amenable to such an iiterprelation, bo le of 
the animals are demonstrably SA i5 
the act of feeding and lii, ae 
however, possible that the presen may 
residual motor or sensory a an 
make ingestive behavior more ewe 
may thus increase the threshold r deoxy. 
or drinking in response to insulin, i cellular 
D-glucose, and extracellular or intrac verfu 
dehydration, which may not be as jene to 
as is Senerally assumed (the. eticii 
insulin and 2-deoxy-D-glucose, in par trols): 
is often small and variable even in con imos 
"he fact that recovered animals adu in 
always drink normal quantities of bi when | 
the Presence of food but are adipsic i 


‘psia 

the adips! 

food eprived suggests that. the Pn mo- 

May be related to a dysfunction o tri- 
tivational 


; son 
Nature, but a possible e 
Sensorimotor Ripa d 
of the recovery cycle € this 
uled out. Clarification a an 
question and of the role of ir eal 
motor dysfunctions in the igprioniy the 
sient aphagia is. Without doubt, one 

most important tasks in this field. 


bution of 


Some Stage 
cannot be p 
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, hypo?" 
Although hyperphagia-inducing : * the 
thalamic lesions typically involv 
ventromedial nucleus, it has long 48 
‘Nown that the most effective lesio! 
ateral 

itself 
1962 
th 


pee” 


the 
oe 
and slightly ventral © b 
Brobeck, 1946; pon d 

etherington & stricte, 
at lesions that are res ype 
€ itself do not produce "zi 
Phagia (Joseph & Knigge, 1968). . 
the effect; 


an? 0 
etherington & oe é {0 
S Suggested many ye ue 
Petphagia might be 
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ped d projections to or from the 
8) hes SU DB, T046), Gald 
of lesions ecently re-examined the effects 
thalamus wu me ventrem hypo- 
implicating tl - et of new evidence 
way in feo si ventral noradrenergic path- 
and oat da satiety (discussed later), 
only lesion ec E his investigation that 
ventrolateral which interrupt this bundle 
result in im T. the ventromedial nucleus 

Tie GEN a hyperphagia syndrome. 
rom edl siron noradrenergic bundle arises 
ascends Gone the medulla and pons and 
ts axons in ug 1 the ventral tegmentum. 
iencephalo nervate medial portions of the 
Projections - preferentially, although longer 
een obse bus: limbic structures have 
uu pee. Q0 159 Pure & 

e context ). It is of some interest in 
ew noradre o RU discussion that only a 
of the vent energic axons terminate on cells 
the hen DU nucleus itself and that 
Ventricular e "m profusely to the para- 
Appears ( ucleus (Fuxe, 1965), which 

ici O be a focal point for feeding 


also been implicated by Ahlskog and 
after diii report of hyperphagia 
Ventral teg ytic or chemical lesions in the 
Y this a tol megion that is traversed 
(às well a le, which depleted diencephalic 
"hat the serui norepinephrine. 
amic neta ects of ventromedial hypo- 
ght ma m on food intake and body 
= ventral be due to an interruption of 
" à! noradrenergic bundle is also 
Ua our observation (Grossman, 
Caudal iG 1 nife cuts in the coronal plane 
Spare the the ventromedial nucleus (which 
tissues that are most commonly 
interru, byperphagia-inducing lesions, 
: medial | the posterior connections of 
ta Se lypothalamus, including the 
| ced ml. es bundle) also pro- 

Unpubli ed hyperphagia. Hennessy an 
ently ae observation, 1974) have 
edet tudied the effects of these cuts 1n 
üg in ail, because Sclafani (1971), work- 
Signife my laboratory, did not observe 
cant changes in food intake or body 


thal 
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weight following knife cuts immediately 
behind the tip of the ventromedial nucleus. 
Our investigation disclosed that only cuts 
which transected fibers coursing ventral 
and lateral to the region behind the ventro- 
medial nucleus consistently produced hyper- 
phagia. This observation is congruent 
with the hypothesis that a transection of 
components of the ventral noradrenergic 
bundle may be responsible for this effect. 
We also observed that the effects of such 
cuts on food intake and body weight are 
prominent and persistent only in female 
rats. Males showed marked hyperdipsia, 
but only more transient and relatively 
mild effects on food intake, which might 
easily escape detection in experiments that 
use male subjects exclusively, such as 
those reported by Paxinos and Bindra 
(1972). 

We (Sclafani, 1971 ; Sclafani & Grossman, 
1969) as well as a number of other investi- 
gators (e.g., Albert & Storlien, 1969; Gold, 
1970) have also observed hyperphagia 
following parasagittal knife cuts between 
the medial and lateral hypothalamus. It 
was originally assumed that this might be 
due to an interruption of direct inter- 
between the ventromedial 
region and adjacent parts of the lateral 
hypothalamus, which recent neuroanatomi- 
cal investigations had demonstrated (Arees 
& Mayer, 1967; Millhouse, 1973a, 1973b). 
However, a more detailed analysis of the 
histological materials (Gold, 1970; Sclafani, 
1971) has shown that the most effective 
cuts are anterior and dorsal to most of 
the ventromedial nucleus and anterior to 
the lateral feeding area itself, Gold, 
Quackenbush, and Kapatos (1972) have 
reported that a unilateral cut in this region 
(which produced only minor and transient 
changes in food intake) can be combined 
with a contralateral lesion in the region of 
the mammillary bodies or in the ventral 
tegmentum to produce persisting hyper- 
phagia. The possibility of a hypothalamic 
synapse is certainly not excluded by these 
observations, but they suggest that the 
same fiber system may be involved in 
hyperphagia-producing lesions and coronal 
cuts posterior to the ventromedial hypo- 


connections 
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thalamus and in the parasagittal cut stig 
lateral to it (as well as in a rp E 
area of the ventromedial hypothalamu 
Eo of the effects of coronal cuts 
through the anterior hypothalamus sug- 
gests that the dorsolateral projections to 
or from the ventromedial hypothalamus, 
which are responsible for the hyperphagia 
in these experiments, may ascend for some 
distancein the medial segment of the antero- 
lateral hypothalamus. Sclafani (1971), 
working in my laboratory, as well as others 
(Palka, Coyer, & Critchlow, 1969; Storlien 
& Albert, 1972) have found th 
coronal cuts immediately 
of the ventromedial nucle 
cantly modify food int 
I have observed transient hyperphagia and 
exaggerated weight gain after larger cuts 
in the same coronal plane, which extended 
into the medial aspects of the lateral hypo- 
thalamus (Grossman, 1971). Other investi- 
gations have subsequently reported moder- 
ate hyperphagia and increaseq Weight gain 
in male (Paxinos & Bindra, 1972) as well 
as female (Storlien & Albert, 1972) rats, 


after extensive coronal Cuts through the 
anteromedial hypothalamus, 


Hyperphagia has also 
following knife Cuts in 
plane, which effectively isol 
medial nucleus (and 
from the Paraventricular nucleus (p. 

c é a alka, 
Liebelt, & Critchlow, 1971). ese ien: 


vations are compatible with the hy i 
t oth 
that an Interruption dr iran 


ventral noradrenergi 
tribute to the h 


at small 
rostral to the tip 
us do not signifi- 
ake or body weight. 


been reported 
the horizontal 
ate the ventro- 
overlying regions) 


at i 
i amic lesions, 
may be particularly ignificant in vi 
paraventricul 
feeding e 
drenergic 


ar nucleus ma 
licited by microinj acti 
compounds, 

Although most experimental observations 
are compatible with this interpretation 
one should keep in mind that t e suggested 
relationship between satiety and noradren- 
ergic components of the medial hypo- 
thalamus is based 


almost entirely on 
inferences from possibly misleading Corre- 
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t 
lations. There is a general agreement E 
the ventral noradrenergic bundle pem 
the midbrain and projects mainly to ue 
portions of the diencephalon, but so E. 
Course and distribution are not ye ro- 
known. Moreover, the norarirenera' Eia 
jections are diffuse and overlap mp athe 
with a wide variety of other d € 
Ways that project to the media a tha 
thalamus, and it is by no means ae a on 
We are justified in focusing our attenti 
the monoaminergic projections. inder- 
We have only recently begun to t [ the 
stand the anatomical organization etail 
ventromedial hypothalamus in any aviorā 
and it is not yet clear how the beha an 
effects of lesions and knife cuts a ana 
around the area are related to - wa 
tomical data, Until very recently "hypo" 
believed that only the anterior t pro” 
thalamus provided significant afferen ^h 
jections to the ventromedial nucleus | d 
1970), and it was widely held m 
nucleus had very few if any efferen HT 
tions beyond its border (Kaelber & ing it 
1967; Sutin & Eager, 1969), oe one 
“enveloped in splendid isolation, 1 3a) 
reviewer recently put it (Millhouse, it 
Only within the last few years has of 
€monstrated that the outer shell, es ia 
nucleus (a) receives afferents from a E 
terminalis, the mammillary pedunc, eral 


: the later 
medial forebrain bundle, and the Jatef? 
ypothalamus ; 


hypothalamus, t 
the zona 
ypothalan 
ally into 
and medi 
& Wolf 3 
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HORMONAL INFLUENCES ox. HyPoTHAN 
YPERPHAGIA 

role of g 


of hypo 


che 


inPU gy 
at sex hormones " má" 
Other oJ d body weight, heir € " 
ie ero Servations Suggest that the! meg 
hay be Mediated jy hypothalam!C ip 
nisms. The nature of this relatio” 
as yet poorly Understood, 


Lo 


H 


HYPOTHALAMUS AND FOOD AND WATER INTAKE 


en Sne mels rat, castration decreases 
because pen ut increases body weight 
(Kakolewski, sharp decrease in activity 
Dieetin i Got & Valenstein, 1968). 
fond wes testosterone propionate return 
Hutchinson, T eon levels (Hervey & 
intake a d 973). In the female rat, food 
estrous eval activity covary with the 
Stramin bugs l (Brobeck, W heatland, & 
Pee. FR 947 ; Kennedy & Mitra, 1963); 
lewski an increases food intake (Kako- 
Of estradiol pe 1968); and injections 
(Zucker P Ta reverse this effect 
interesting a 1972). It is particularly 
ood tink iy hat estradiol does not modify 
abnormally em the animal is at an 
Nussbaum, high body weight (Redick, 
These n, & Mook, 1973; Zucker, 1972). 
that the n related observations suggest 
rain to veia hormones may act on the 
body uut um “set-point” at which 
Kenne En is regulated (e.g. Mook, 
n oberts, Nussbaum, & Rodier, 
Footage te 1972). Similar set-point 
the role d have been advanced to describe 
l'éitelbaun the ventromedial (Hoebel & 
cesey ie ae or lateral (Powley & 
Considerable ) hypothalamus, and there 1s 
acting on i evidence that steroids may be 
though hypothalamic mechanisms (al- 
Modify not exclusively so) in order to 
food intake. 

Mus (Re in the ventromedial hypothala- 
hyperpp. nnedy, 1957) do not produce 
they re; agia in immature female rats until 
Tats T puberty. Although prepuberal 
Phagic pie eerie adults) are hyper- 
relation m food intake is considered in 
the iun ae weight, the ineffectiveness 

© reflec h othalamic lesion does not appear 
Prépubo s ple ceiling effects, because 
“easing " rats are quite capable of in- 
pid (d. food intake when exposed to 
i he nd um, Cheng, & Rozin, 1969). 
Men repor t marked sex differences have 
Phenoman ed, but the generality of the 
Us assein is uncertain. Valenstein and 
1 ein, 1969 E (Cox, Kakolewski, & Valen- 
969) los alenstein, Cox, & Kakolewski, 
Ventro, Ve reviewed numerous reports © 
that ee hyperphagia and concluded 
€ effects of ventromedial hypo- 


213 


thalamic lesions on food intake and body 
weight were significantly larger and more 
readily obtainable in the female. A direct 
experimental test provided support for 
this hypothesis. The results of more recent 
investigations (e.g., Gold, 1970; Rehovsky 
& Wampler, 1972) indicate that this sex 
difference may not be as general a phe- 
nomenon as the earlier reports suggested, 
but the possibility of hormonal influences 
has not been ruled out. 

Ventromedial hypothalamic lesions dis- 
rupt female mating behavior (Kennedy & 
Mitra, 1963), and there is some controversy 
concerning their effects on gonadal func- 
tion. Valenstein et al. (1969) reported 
evidence of gonadal atrophy after ventro- 
medial lesions, but Wampler and Gier 
(1974) recently presented evidence suggest- 
ing that male as well as female gonads 
retain their capability to secrete steroid 
hormones after ventromedial hypothalamic 
lesions and may, in fact, be unaffected by 
conditions (such as constant illumination) 
that interfere with cyclic hormone release 
in the female rat. Ventromedial hypo- 
thalamic lesions increase the release of 
prolactin, which has been shown to increase 
food intake in the female, but not in the 
male (Pfaff, 1969). The ventromedial 
hypothalamus is most directly implicated 
by Wade and Zucker's (1970) report that 
implantation of estradiol into that area, 
but not into adjacent regions, decreased 
food intake within 12 hours in spayed 
female rats. Similar observations were 
recently reported by Jankowiak and Stern 
(1974). 

Direct comparis 
of ovariectomy an 


ons between the effects 
d ventromedial hypo- 
thalamic lesions indicate some remarkably 
detailed similarities (e.g both increase 
meal size in the light as well as in the dark, 
but increase meal frequency in the light 
only [Balagura & Devenport, 1970; Kenney 
& Mook, 1974]) but also some clear 
differences (e.g., lesions increase finickiness, 
while ovariectomy has opposite effects 
[Teitelbaum, 1955; Zucker, 1969 ]. 
These observations are congruent with 
the hypothesis that estradiol may affect 
food intake by acting on components of 
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the ventromedial hypothalamus. However, 
others indicate that there must be other 
sites of action and that the effects of lesions 
in this area cannot be entirely due to a 
loss of normal steroid influences on food 
intake. The first of these reservations is 
suggested by the fact that ovariectomy 
increases, and subsequent hormone treat- 
ments decrease, food intake in rats that 
are hyperphagic as a result of ventromedial 
hypothalamic lesions (King & Cox, 1973; 
Montemurro, 1971). Nor does the critical 
site of action seem to lie in projections to the 
lateral hypothalamus, as one might con- 
clude from the observation that females 
show a permanently lowered body weight 
set-point after lateral hypothalamic lesions 
(Powley & Keesey, 1970), whereas males 

do not (Cox & Kakolewski, 1970). This is 
indicated by the fact that ovaries exercise 
a restraining effect on body weight in rats 
that have recovered from the effects of 

lateral hypothalamic lesions (Cox & Kako- 

lewski, 1970; Harrell & Balagura, 1975). 

The second point—that lesions in the 

ventromedial hypothalamus do not merely 

prevent the action of steroid hormones—is 

indicated by the fact that the effects of 

E k Severe and persistent 

iode *- er ovariectomy, A 

0 € in body weight is commonly 


Seen after lesions, whereas ovariectomy 


in an increase of 
(Mook et al., he E 


ment against su int i i 
further Re oi by i 
hyperphagia that results from vent E 
medial hypothalamic lesions is ap da zc 
E Ww y by hypophysectomy on 

t al., ut is ma . 
subdiaphragmatic ven Pu 1 
Opsahl, 1974), ils 

It is clear that we 


do not yet und 
a ersta 
the etiology of the hypothalamic os 


phagia, but it appears likely that ; 
may have several independent MD 
Some observations from my own Nine. 
tory, as well as others, Suggest that some 
of the contri buting factors can be isolated 
by recently developed neuroanato 


y c mical or 
neurochemical techniques, 
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Hennessy and I (unpublished observa- 
tion, 1974) have recently completed a 
detailed investigation of the effects O 
coronal knife cuts in the posterior hypo- 
thalamus and have found that the hyper- 
phagia that reliably occurs after some © 
these cuts differs in some critical respects 
from that seen after electrolytic lesions in 
the ventromedial hypothalamus. Female 
rats displayed a pronounced hyperphagta: 
but the effect was relatively short-live 
(5-8 weeks) and rarely resulted in a tota 
increase in body weight in excess of about 
25%. Most male rats overate only slightly» 
if at all, shortly after surgery but displaye* 
à dramatic hyperdipsia (the animals co™ 
sumed 200-300 ml per day) that persistec 
unabated for many months. ‘These anima 1 
had renal dysfunctions similar to those Ww 
in diabetes insipidus (even though the kni 
cuts did not involve the infundibular prenn 
Suggesting that the excessive water int? E 
may have been related to an interferen? 
with antidiuretic hormone release DY e 
Pituitary (Grossman, Hennessy, & a 
unpublished Observation, 1974). In man) 
animals the excessive water intake Kt 
however, entirely feeding related, and Wê É 
ponsumption dropped to normal or eve? a 
int normal levels during periods of € 
“privation, Suggesting that other fact ia 
ud have contributed to the hyperdiP? 
sore: and I (unpublished observa sia 
:914) have recently observed hypere!P^ | 
In animals with coronal cuts in the VC? 
del mesencephalon, several millimct t 
"- the infundibulum. We have 
2-. tested kidney functions in | 
cut — e Piper unlikely tha none 
nds interfere with pituitary ^ that a 

the. st thus appears possible ! Aat 
pathway that normally exerts inh! ae 

on water intake may the 


t 
hrough ventromedial aspects © 


brainstem, 

_ It is interes 
tion, that th 
chemical 


ting to note, in this con y 
* hyperphagia produce jor? 
and electrolytic brainstent : jg 
“rupt the ventral noradrem oy) 
Ahlskog, Hoebel, & Breisch» r 


the feedi «e nO 
ephri Ing r > tO 
n ine i 8g response 


Pjections into the region ° 


bundle ( 
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aget ined nucleus (Leibowitz, Note 
The ot gren by hypophysectomy. 
Bie ventral = hoe studies implicating 
entethalie oradrenergic bundle and its 
of food ee in the regulation 
in this ae particularly interesting 
components - because noradrenergic 
been fetpfies : the hypothalamus have 
B ertonez m in the release of steroid 
brain fiis or instance, steroids alter 
s Cei: metabolism (Anton-Tay 
hypothalamic 1971; W urtman, 1971); 
With the ea norepinephrine levels vary 
1967); and vies cycle (Stefano & Donoso, 
and o wc ng levels, synthesis, 
Crease afte r have been shown to 1n- 
Anton & er ovariectomy (Anton-Tay, 
elham & oo 1973; Anton-Tay, 
urtman be 1969; Anton-Tay & 
More A Bapna, Neff, & Costa, 
IScussion S irectly to the point under 
tave e p m and DiCara (1973) 
tion of the i that noradrenergic stimula- 
ood inter ntenor hypothalamus increased 
levels. Mis regardless of plasma estradiol 
ceding Sahat dopamine injections elicited 
"rom this y er estradiol titers were low. 
Son and Dic related observations, Simp- 
gestion th; 'Cara (1973) derived the sug- 
inhibit nk high levels of estradiol might 
c Ce by ME hypothalamic 
iu the roxylase, the enzyme 
tom do s: sy nthesis of norepineph- 
Observation amine. Some experimen- 
3) are not a (eg., Stern & Zwick, 
n interpret, readily compatible with such 
iin of th ation, but the possibility of a 
Ne hypog i noradrenergic components of 
ated in „lamus, which have been impli- 
ot investigatio by numerous other lines 
E h the infin n discussed in this article, 
fare t"iguin nenge of steroid hormones is 
her a possibility, which deserves 
ention. 


VIDE 
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N 

Ow t] 
: li i 
at we have reviewed some of the 


en 
ot ges questions the unique role 
ask ive portalams in the regulation of 
of 4 at evi avior, it seems only fair to 
the p. S Vidence might be cited in support 
Ypothesis that cellular components 
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in the region of the postulated feeding and 
satiety centers may, after all, contribute 
significantly to the regulation of food or 
water intake. 

Electrical stimulation in the hypothala- 
mus has been shown to elicit feeding or 
drinking (Andersson & Wyrwicka, 1957; 
Miller, 1957; Mogenson & Stevenson, 
1967), as well as hunger- and thirst- 
motivated instrumental behaviors (Anders- 
son & Wyrwicka, 1957; Coons, Levak, & 
Miller, 1965; Miller, 1963). The behavioral 
specificity of these elicited behaviors has 
been questioned (Cox et al, 1969; see 
Mogenson, 1973; Teitelbaum, 1973; Valen- 
stein, 1973, for reviews of the current 
status of this issue), but even if one 
accepts the view, favored by most investi- 

this field, that anatomically 


gators in 
distinct hypothalamic mechanisms exist for 
the regulation of food and water intake, 


nothing indicates that the effects of elec- 
trical stimulation are due to an activation 
of cellular components of the region rather 
than fibers of passage. 

Many years ago, I 
evidence for feeding- 
neurons in the hypot 
the behavioral effects of 
of putative transmitters, whicl 
selectively on postsynaptic cellular. mem- 
branes (Grossman, 1960, 1962a, 1962b). 
These experiments demonstrated that in- 
jections of norepinephrine and related 
compounds into the hypothalamus elicited 
feeding and inhibited drinking, whereas 
injections of acetycholine and related 
synthetic cholinomimetics elicited drinking 
and inhibited feeding. These observations 
have been repeated and extended by 
others (e.g., Booth, 1967; Fisher & Coury, 
1962; Leibowitz, 1970; Margules, 1970; 
Margules, Lewis, Dragovich, & Margules, 
1972). It now appears that adrenergic 
compounds may not only elicit feeding but 
also have inhibitory effects on intake 
(Margules, 1970), depending, perhaps, on 
the injection site (Leibowitz, 1970, 1971), 
taste factors (Margules & Dragovitch, 
1973), or time of day 


(Margules et al., 
1972). But is this persu: 


attempted to obtain 
and drinking-related 
halamus by observing 
microinjections 
| should act 


asive evidence for 


the existence of neurons which are specific- 


216 


ally related to feeding, drinking, or satiety 
m. "pothalamus ? ee 
bol can be elicited by injections 
of abolincmsneiien into ang sites 
outside the hypothalamus (Fisher & 20 
1962; Grossman, 1964a, 1964b), ee des 
effect can be blocked by injections " pe 
cholinergic agents, such as ey m 
many widely distributed sites ia the p 
system (Levitt & Fisher, 1966). i e 
ubiquity of the effective placements as 
suggested to some investigators that the 
compounds might diffuse toa single site 
of action, perhaps via the ventricles 
(Routtenberg, 1967). Autoradiographic 
data on the rate and extent of diffusion of 
atropine from an injection site (Grossman 
& Stumpf, 1969) are not readily 
with such an interpretation, but the 
available data do not rule out the hy- 
pothesis, because effective diffusion and the 
effects of transport via the vascular system 
are not well understood at this time. The 
possibility that the high concentrations 
used in these experiments May produce 
nonspecific effects on fibers of passage has 
also not been ruled out. 

My original investig. 
ergic feeding effect sug: 
effective injections 
fornical region of 


congruent 


ation of the adren- 
gested that the most 
Involved | the peri- 


: the hypothalamus, but 
active sites were also seen rostral, dorsal, 


and medial to that area (Grossman, 1962a), 
Booth (1967) has reported a Concentration 
of positive points in the dorsal Tegions of 
the anterior hypothalamus and basal thala- 
mus and has Suggested that Projections 
from the Stria medullaris might be re- 
sponsible for the effect, Leibowitz (1970) 
noted that most Positive inject; 
were concentrated in the 
thalamus and has recently ( 
ally implicated the paraver 

and the Supraoptic nucleus. 
Sponses to alpha 

have not been repor 


adrenergic Compounds 
in the dorsol 


ted from injection Sites 
ateral hypothalamus 


. : y » Where 
lesions result in persistent aphagia and 
adipsia. It may be Significant, in this 
context, that the inhib 


itory effects that 
associates ( 
& Dragovich, 


Margules and his Margules, 
1970; Margules 


1973; Mar- 
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755 ere 
gules et al, 1972) have ahserved, M 
obtained from cannula placements 
lateral hypothalamus. . — 
On balance, these observations are neph- 
patible with the notion that pese 
rine may act as an inhibitory ique. 
on cellular components of the m (i.e 
hypothalamus that exert uae hs ail 
satiety) influences on feeding. RE 
interpretation is congruent with the we 
literature, but it is difficult to comp drug 
rule out the Possibility of nonenecine TH 
effects that might involve fibers of pe even 
rather than postsynaptic membranes, Ep 
though recent investigations have popu 
in eliciting (Leibowitz, Note 3) Or ding 
tating (Ritter & Epstein, Note c rine: 
with Nanogram doses of ap tea mn 
which begin to approximate physio 
quantities, adre 
The Strongest evidence that cdi pla 
ergic Components of the y gener i 
an important role in the regulate has 
food intake, at least in the monkey 913): 
been reported by Yaksh and Myers invest!” 
sing a Push-pull cannula, these i 


pe 
-. 
Sators found that a perfusate 


E ot non^" 
hypothalamus of a food-deprived ! ae 
Would elicit feedir 


ig when injected int? 
hypothalamus of a sated animal iem of ? 
a perfusate from the hypothalamus in ? 
Sated monkey would inhibit bearing ar 
deprived one, These observatio gt t 
Particularly significant in the ge tha 
this discussion, in view of the ae ne 
Perfusates from hungry monkeys Go rin’ 
an increased quantity of a norepineP™ o 


à nS 
Ixe substance and that apr ye 
exogenous norepinephrine into th fort e 
thalamus also elicited feeding. pat, 
nately, these intriguing experiment? pie) 
not yet bee 1 
L 


J95 "m 
are other experimental oi ws 
ich support the hypothe” ial” 
Ponents of the hypot it m 

and water intake PY Th 
alternativo explanations: coy 


com 


Systemic adm 


: : ose pe oj 
“Ic compound goldthiogltct is ald i 
centrates in the medial hypothalam^ y 
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lesi 
Devel, sor bye erphagia (Mayer & 
be insulin ee The effect appears to 
rom, & — (Debons, Krimsky, 
interpreted t outier, 1969) and is often 
the ventrom mee the hypothesis that 
receptors io ra SATOR deer eder qo gluco- 
oe organism. Thi the nutritional state of 
explanation i his is certainly a plausible 
Suggest that xut a number of observations 
instance ibd a is not compelling. For 
(and i s thioglucose also concentrates 
of the brain (s lesions) in many other parts 
Teview), sug see Liebelt & Perry, 1967, for 
etermined peni that uptake may be 
rather than 7 PRA metabolic activity 
Moreover specific glucoreceptor functions. 
elicit leni menin which is thought to 
Or more Specii by decreasing blood sugar, 
y the hy et e the utilization of sugar 
Unues "ig agg glucoreceptors, con- 
xtensive ie feeding in rats with 
lalamus (Epst in the ventromedial hypo- 
he hama ni & Teitelbaum, 1967). 
an Phi = of central glucoreceptors 
inis qe by recent observations, 
acellular : that feeding responses to 
Y lesions i g ucoprivation are abolished 
( as & K in the anterior hypothalamus 
& Stee Sraly, 1974), zona incerta (Walsh 
974), imd cat unpublished observation, 
STossmay nid brain tegmentum (Alheid & 
that do s unpublished observation, 1974) 
eal PUN ad-libitum food intake. 
Dude MAR for the presence of 
Comes fy ated cells in the hypothalamus 
of single om investigations of the activity 
1973. e ron in the region (see Oomura, 
Cistinguish review). This technique can 
Tom tho h potentials generated by cells 
and ( tas Propagated by fibers of passage 
our ee evidence directly related 
19 Nand, eu Earlier investigations 
; Bi Sharma, Dua, & Singh, 
ura, ju Banerfem & Anand, 1968 ; 
mhiyoshj ioe Ooyama, Maeno, Iki, & 
atic chan 64) reported apparently syste- 
anges in single cell activity in the 


intr 


Cedin 


Entr 
Omedi 

fo owing al and lateral hypothalamus 
Ycemi Systemically induced  hyper- 


ao p 
“vation r hypoglycemia, cellular gluco- 
"ece d or stimulation of gastric stretch 
s. » 
These observations lend some 
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support to the hypothesis that gluco- 
receptors and receptors sensitive to disten- 
sion of the stomach may be present in 
the hypothalamus but also admit other 
plausible interpretations, because the treat- 
ments were, in most cases, quite extreme 
(e.g. the hypothalamus has been implicated 
in "affective" reactions to noxious stimu- 
lation, and the observed changes in cellular 
activity may well have been related to 
such reactions). 

More recently, insulin and glucose have 
been applied iontophoretically to individual 
cells in the ventromedial and lateral hypo- 
thalamus (e.g., Oomura, Ooyama, Yama- 
moto, Ono, & Kobayashi, 1969), and cells 
have been found in either location which 
increased or decreased their firing rate in 
response to such treatments. It is possible 
that the selective activation of a cell by 
glucose or insulin may not reflect reactions 
peculiar to feeding-related glucoreceptors, 
but these experiments do avoid some of the 
ambiguities inherent in the earlier studies. 

Also in harmony with the hypothalamic 
model of food intake regulation is the 
observation that a reciprocal relationship 
appears to characterize at least some cells 
in the medial and lateral hypothalamus. 
Although the number of such concurrent 
observations is not large, and puzzl- 
ing negative results have been published 
(Murphy & Renaud, 1968, 1969), it may 


be significant that an increase in the firing 
rate of a particular cell in the lateral 
hypothalamus is, at least under some 
circumstances, accompanied by a simul- 
taneous shift to slower discharge rates of a 
cell in the ventromedial region (Oomura, 
Oovama, Yamamoto, & Naka, 1967) and 
that electrical stimulation of one of these 
areas can inhibit the activity of a cell in 
the other (Van Atta & Sutin, 1971). 
What is missing, at this point, is a clear 
relationship between these phenomena and 
food deprivation, feeding, or satiety. Re- 
ciprocal changes in single cell activity have 
been seen mainly during various sleep 
stages and during the transition from sleep 
to wakefulness or from drowsiness to 
arousal, and their relationship to feeding 


is tenuous at best. 
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CONCLUSIONS 


Where, then, do we stand in our under- 
standing of the hypothalamic mechanisms 
believed to regulate food and water intake? 
It has been the thesis of the present review 
that the fiber systems that course through 
the hypothalamus may play a more im- 
portant role in the etiology of the dysfunc- 
tons in food and water intake that are 
seen after hypothalamic lesions and stimu- 
lation than the widely accepted model of 
hypothalamic regulation implies. These 
considerations do not compel the interpre- 
tation that cellular components of the 
hypothalamus may not exercise important 
and possibly essential regulatory functions, 
but they do suggest that a re-examination 
of widely held and possibly unfounded 
preconceptions is in order. 

Although the picture that has emerged 
Írom recent investigations is incomplete, 
puzzling, and in some instances not 
entirely consistent, it appears likely that 
the aphagia syndrome, which charac- 
terizes rats with lateral hypothalamic 

lesions, may be due entirely or in part to an 
interruption of interconnections between 
the striatum and lower 
cluding the dopaminergic nigrostriatal pro- 
jections, which have no known synapse in 
the lateral hypothalamus. The functional 

importance of the regulatory deficit (i.e., 

lack of response to cellular glucoprivation), 

E persists after recovery of voluntary 

mring has been questioned by the results 

appears increasing magis Deae 
initial taa possible that the 

arment in feeding may þe the 
result of sensorimotor disturbances rather 
than a reflection of a disruption of moti- 
vational mechanisms specifically 


brainstem, in- 


j rela 
eating. uds 
The effects of lateral | 
q a Typothalami 
lesions on water intake y Er 


appear to be due 
ssues in the zona 
g adipsia is not 
or motor dysfunc- 


to the destruction of ti 
incerta. The resultin: 
explained by sensory 
tions, since feeding-related drinking occurs 
in animals that are entirely adipsic during 
periods of food deprivatio; a 


1 and fail to 
respond to cellular dehydration or extra- 
cellular hypovolemia, There jg Bome 
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evidence that the zona incerta and lateral 
preoptic region may contain recepton 
which may “meter” the state of celta 
and perhaps extracellular hydration. T t 
role of fiber systems that course throug? 
the zona incerta and adjacent subthalamic 
region remains to be elucidated. . a 
The results of many recent experimen 
indicate that the overeating and obesity 
seen after medial hypothalamic damage 
may be related at least in part to noradrent 
ergic pathways, which may synapse 9 
cells in the area. 
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A Schema Theory of Discrete Motor Skill Learning 


5 Richard A. Schmidt 
epartment of Physical Education, University of Southern California 


A number of closed-loop postulations to expl 
have appeared recently, 
logical problems in explaining the phenomena or 


performance phenomena 
suffers from either (a) 
(b) predictions that 
these difficulties are 
proposed that seems capa 
is based on the notion o 
duce movemen 
Some of the predictions a 
that can be used to test t 
the theory are presented. 


b field of motor behavior has become 
Med interesting in the past few years, 
Cisne ly as a result of some rather severe 
ne ges in the way in which researchers 

ve tackled their problems. Prior to about 
din th area seemed to be dominated by 
This ask-oriented" approach (Pew, 1974). 

$ approach emphasized "global" motor 


i een 
earning theories, such as that of Hull 


(1943), or emphasized no theories at all, 


an z 
me the area was dominated by experiment- 
nur Who were testing the effects of a large 
ied of independent variables on the 
RM learning and performance of motor 
eol Scoring was usually in terms of some 
on Y gross index of responding, such as time 
on target for 30 sec, and there was little 
cern for the events that changed within 
Pro leagues who 
2 ged many useful comments on an earlier draft 
Ron d paper: Jack A. Adams, Sheila Henderson, 
` ae G. Marteniuk, Karl M. Newell, Richard 
e €w, and H. T. A. Whiting. 
` ests for reprints should be si 
nig chmidt, Department of Physica 
alio sity of Southern California, Los Angeles, 
Ornia 90007. 


A ors 
Dpreciation is extended to my co! 


sen 


are not supported by 
discussed, a new theory for discrete moto; 
ble of explaining the existing findings. 
f the schema and uses a recall memory to pro- 
t and a recognition memory 


re mentioned, 
he predictions are listed, and data in support of 


ain motor skills learning and 
but each of these views 


the empirical evidence. After 
r learning is 
The theory 


to evaluate response correctness. 
research techniques and paradigms 


the individual that enabled him to perform 
or to learn the motor task. 

Since 1960, however, there has been a 
considerable shift in emphasis in motor skills 
research. Motor behaviorists have begun to 
ask questions about the kinds of processes 
occurring as the individual performs and 
learns the motor response. The tasks used 
have tended to shift from those that could 
only be scored with global measures to those 
that enabled the isolation of various pro- 
cesses and strategies and provided informa- 
tion about contributions of various subsys- 
tems. A number of researchers took the 
lead from the new directions provided by 
Fitts and his colleagues (e.g. Fitts, 1954) 
concerning the processing of information in 
skills. Examples of these new concerns were 
the time to process visual information (Keele 
& Posner, 1968), the development of error 
detection mechanisms (e.g Schmidt & 
White, 1972), and the locus of attentional 
requirements in simple movements (Ells, 
1969). Questions such as these were not 
popular under the earlier traditions. 
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A part of the reason for this shift in em- 
phasis has been the inadequacy oí the earlier 
theories in providing reasonable explana- 
tons for motor learning and performance; 
this is well documented by the controversy 
about Hull’s (1943) theory that raged in 
the late 1950s and early 1960s (see Adams, 
1964, for a review) and finally resulted in 
an almost total loss of interest in these points 
of view. The lack of adequate theories for 
guiding research in motor behavior caused 
motor behaviorists to reach out in other di- 
rections for explanations of motor learning. 
Of course, an important event in a number 
of fields was the arrival of the information 
processing and cybernetic ideas that led to 
closed-loop theory, and the application of 
these notions to psychology and motor skills 
was welcomed with great vigor. 

Of the closed-loop theories, some were 
quite old, for example, Bernstein (1967) in 
essays originally published in 1934 and 1957 
but there have been a number of widely dis- 
cussed recent additions. The closed-loop 
models of Sokolov (1969), Anokhin (1969) 
and Konorski (1967), plus t : 
Adams (1971) 


an error, ar 
Subsequently corrected, 


5 many closed-loo 


à theori 
were, it w; f aor m - 


as surprising that th 

t ere w 
few serious attempts to test them » m 
another experimentally. It is i. 
the theories did 


among the workers 
ideas have been u 


various experiment: 
authors. Rather, it is 
theoretical ideas are not strict] 
are more properly term 

man, 1960, for M scies ds bus se 
between models and theories). 
models are very easy to construct 
ingly all one needs to propose a né 
Is a few appropriately labeled p 


there 
re so 
st one 
not because 


as Seem- 
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posedly representing processes in motor per- 
formance, with connecting arrows, and a 
new model is born. What has often resulted 
are models that are not tied to experimental 
data for their formation and that do not 
provide ways for conducting tests of theng 
The result is a diagram that is useful R 
haps in visualizing what might be pom 
in skills and learning, but which has lit 
basis in fact and cannot be verified (or, p 
properly, disproved) by experiment. E 
kind of model building has been taken. i 
too seriously by students of motor behav 
A notable exception to this statement "d 
the ‘closed-loop theory proposed in 1971 A 
Adams. This theory was generated a 
way quite different from that of the T cid 
closed-loop ideas, First, the original er 
was an existing body of carefully cnn a 
basic research in motor learning—that dr 
ing with the learning of slow, graded, n 
positioning tasks. Focusing on these d 
Adams attempted to explain the vari 


t ions 
ndings with the aid of closed-loop notis 


providing operatiot 
his constructs, by sug eid 
experimenta] variables that, when apP! Pn 
ately manipulated, should produce cert 

changes in particular measures, and 
Sesting experimental paradigms f 
various aspects of the theory. The ree 
of w 
the Appearance of approximately 2 
testing the Adams theory in the e 

Years after it was introduced—already » 
than for all of the earlier closed-loop 
Orists combined, Clearly, the produc : 
a theory that was so easily testable y 
Searchers was appealing, and the te for 
theory became a most important are i 


the field of motor learning in a very 
time. 


Tue Apams THEORY 
Adams’ (1971) theory proposes aot 
are two states of memory, termec s. Tu 
ory trace and the perceptual ope ji 
memory trace, analogous to recall 7 jot 
m verba] learning, is a “modest g n 
Program” responsible initiati’ 


ther? 


for 
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movement, choosing its initial direction, and 
determining the earliest portions of the 
movement. Its strength is developed as a 
function of knowledge of results (KR) and 
practice. The perceptual trace, on the other 
hand, is analogous to recognition memory in 
verbal tasks and is responsible for guiding 
the limb to the correct location along the 
trackway. The perceptual trace is formed 
from the past experience with feedback from 
earlier responses and comes to represent the 
sensory consequences of the limb being at 
the correct endpoint. During the movement, 
the subject compares the incoming feedback 
(from the eyes, ears, proprioceptors, etc.) 
against the perceptual trace to determine if 
the limb is in the correct final location; if 
it is, he stops responding, and if it is not 
(ie, an error is signaled from the percep- 
tual trace), he makes a small adjustment 
and the comparison is made again until the 
limb is in the correct location. With in- 
Creased exposure to feedback and KR, the 
Perceptual trace is strengthened, and the in- 
dividual becomes more accurate and confi- 
dent in his responding. 


Some Strengths of Adams’ Theory 


. Adams' theory has a number of character- 
istics that are generally considered desirable 
attributes, Some of these, in each case rep- 
Tesenting improvements over early closed- 
loop theories, are as follows. 

Concern for learning. Almost every one 


of the early closed-loop theories has dealt 
With the processes thought to occur in the 
red skills. In 


Performance of already-acqui 
Addition to this concern, the Adams theory 
1S directed toward the learning of novel 
Motor tasks, which was not a consideration 
In the earliest work. 
Reduced scope. Adams has begun mod- 
€stly, attempting to limit his theoretical pre- 
dictions to the body of data dealing with the 
learning of linear positioning. Thus, the 
theory is very close to the data and attempts 
to explain them with the theoretical con- 
Structs proposed. To be sure, the theory 
Probably does have applicability to response 
Classes other than linear positioning, but a 
limited focus provides the most solid begin- 
nings for further theoretical developments. 


Empirical support for constructs. The 
processes postulated in Adams' theory are 
all known processes, in that there is con- 
siderable evidence, either directly from 
motor behavior or inferred from other re- 
sponse classes, for each. This has not been 
the case for many of the earlier positions, 
in which processes were proposed without 
either behavioral or neurological support. 
Adams’ approach has been to insist that the 
model postulate only those kinds of pro- 
cesses or mechanisms that have a reasonable 
probability of empirical reality. 

Simplicity. The Adams theory is quite 
simple, postulating a minimum of hypothet- 
ical states to account for the learning of 
positioning tasks. Some of the earlier mod- 
els are extremely complex, with many times 
more states postulated, and it is encouraging 
to think that motor learning might be ex- 
plained by fewer postulations than seemed 


necessary before. 


Some Criticisms of Adams’ Theory 


With all of the research and thinking that 
has developed since Adams’ theory was pub- 
lished, it is not surprising that some short- 
comings in the theory have appeared. Some 
of these result from logical difficulties with 
the theory as stated, and others result from 
recent data that do not follow the theory’s 
predictions. 

Limitations to positioning responses. Al- 
though this is not a serious criticism, some 
researchers are worried that positioning re- 
sponses are not representative of the wide 
range of behavior that one would like to 
term skilled, and that the theory cannot ex- 


plain the learning of other types of res 
iterated that the 


sponses. It should be reiterati 
theory was deliberately limited in scope be- 
cause of the lack of good evidence in other 
types of responses, but it would be a desir- 
able goal to have a theory that predicts per- 
formance and learning in other tasks as well. 
Specifically, the generalization to more rapid 
responses (€.g., kicking and throwing) would 
be important, as would attention to open and 
closed skills or to skills in which the ma- 
other than accuracy (e.g. pole 
The author and his colleagues 


& White, 1972; Schmidt & Wris- 


jor goal is 
vaulting). 
(Schmidt 
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berg, 1973) have attempted to extend 
Adams' theory to simple accuracy tasks re- 
quiring rapid movement (approximately 30 
cm in 150-200 msec), but the theory was 
not specifically designed to handle these 
tasks and tests using such tasks are not 
strictly tests of Adams’ point of view. Some 
revision seems clearly necessary in order to 
extend the theory to more rapid movements, 
The error detection mechanism, 
jor feature of Adams' theory is that it pro- 
vided a means for the subject to determine, 
in the absence of KR, his error for the re- 
sponse just produced, to use thi 
tion (termed subjective reinforce. 
means for maintaining performa; 
to continue to learn without RR | Thean 
thor has argued before (Schmidt, 1974, 
Schmidt, Note 1) that learnin 
does not follow logically fro 
ory. Theoretically, 
perceptual trace and 


The ma- 


s informa- 
ment) as a 


the subject has removed hi 


lever, how can he Senerate an additional] 


cannot. When asked w 


that trial, the subject 


enable the in 

without KR. 
In a stud i 

published pid i972) dt and edt (an- 
10 positioning trials, with 

trial A Paradigm was Used į 

subject responded, a 


was asked 
— Á— B9 
1 Requests for information 
^ con i H 
other unpublished studies cited i this 4n this anq 
be sent to Richard A. Schmidt, pe, le should 
hysical Education, University of rtment of 


fornia, Los Angeles, California 90007 ern Cali- 


Orcement that 


1 would 
to contin ue to 


learn 
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thought his error was (in the units i ss 
Positioning task), and was then provided 
KR. The measure of the error detection 
mechanism was the correlation, computed 
within Subjects, between the actual and 
judged error on the last 20 trials. The high- 
est correlation was .40, and many of -. 
were negative, with the average being an 
Had there been a strong error detection 
mechanism Operating after the movement, 
larger correlations, similar to those found a 
Schmidt and White (1972) for a rapi 
movement task (7 = .90), should have been 
found. The Schmidt and Russell owes d 
agree with the prediction that there uq 
be no error detection mechanism after t 
movement in slow positioning. tan BIA 
earning without KR. Another ge 
tant prediction of the Adams theory 1S gt 
later in learning, when the perceptual es 
is well established, the subject should be 25^ 
to continue to learn without KR. There 5$ 
no evidence that individuals can continue ki 
sun Positioning tasks after KR Lemon 
and the best approximation seems to be bl f 
they can maintain performance reasona y 
well after KR withdrawal (e.g., Bilodeau à 
Bilodeau, 1958). But in spite of the ian " 
evidence, the theory cannot predict mus ia 
learning, Tf, on the first no-KR al 
the movement is not perfect, the perceptus 
trace will be degraded somewhat because N 
the feedback from that response. The E 
response will pe less accurate because ed, 
Perceptual trace h wie e 
and the perceptual trace will be degra er- 
further, and so on. Eventually, the an 
Ceptual trace becomes increasingly jae 
with a Corresponding decrement in em 
and Performance is increasingly inaccur® 
early, the theory cannot predict lea 
without KR for positioning responses. while 
eneralization to rapid responses. : 
the Adams theory was not intended abe 
explanatory for rapid responses, a rt 
of experimenters (e.g., Newell, 1974; N^ 
Chew, 1974; Schmidt & White, 
chmidt & Wrisberg, 1973) have 
Adams’ closed-loop ideas to generate P 
tions for this response class, The reaS. ine 
Was that in slow positioning responso ue 
Memory trace and Perceptual trace 


as been slightly weal 


rnin£ 


Jj 


. 
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experimentally confounded, with positioning 
tasks not providing measures that could be 
ascribed to the variations in the memory- 
trace strength. One solution was to use 
rapid movements, usually the movement of 
a slide for 10-15 in. (254-381 cm) with 
a goal of 150-200 msec, where the move- 
ment would be carried out by the memory 
trace (in this case a motor program without 
feedback involvement), with the perceptual 
trace strength evidenced by how accurately 
the Subject could guess his movement time 
after the movement. 
Many of the predictions that seemed to 
come from Adams’ theory were supported. 
‘he strength of the error detection mecha- 
nism (the perceptual trace measured as the 
Within-subject correlation between actual and 
Judged error) increased as a result of prac- 
tice (Schmidt & White, 1972), the error 
detection mechanism was sensitive to the 
amount of experience with feedback stimuli 
(visual and auditory) present in the task 
Newell & Chew, 1974; Schmidt & Wris- 
erg, 1973), and ratings of the subject's 
Confidence increased with practice as well. 
ut there were some instances of nonsup- 
Port. Shea (unpublished study, 1972) failed 
to confirm that the interresponse interval 
Was a factor in development of either the 
Memory trace or the perceptual trace. Also, 
Adams states clearly that the delay of KR 
Should not be a factor in the development of 
N€ perceptual trace, and yet a study by 
Schmidt, Christenson, and Rogers (in press) 
indicates that delaying KR by 25 sec versus 
Sec caused a drop in the correlation (from 
A2 to .61) between actual and judged error; 
* interpretation was that KR delay i 
farded the development of the perceptua 
race, " 
Perceptual trace development. Adams 
theory holds that the perceptual trace 15 
°rmed from feedback traces associated with 
ing moved to the correct location, and 


at without this experience at pere 
OCatior trace cannot dev ; 
n, the perceptual trac ) had sub- 


Williams and Rodney (in press 

Jects practice a linear positioning task under 
Variety of conditions, and two of these 

Conditions test this prediction. Group is 
target only) moved 16 times to a stop tha 


defined the criterion location, while Group 
IR (interpolated random) moved to 16 ran- 
domly ordered stops (not at the criterion 
location), after being told that the criterion 
location was in the center of these positions. 
When subjects then attempted to move to 
the criterion position without the stop for 
20 trials without KR, the absolute errors 
(see Figure 1) for the two groups were 
nearly equal on the first block of 4 trials; 
also Group IR maintained performance over 
blocks, whereas Group TO regressed sig- 
nificantly. The finding that subjects who 
had never experienced the correct location 
(Group IR) could move as accurately (and 
maintain accuracy more effectively) as those 
subjects who had the conditions necessary 
for the development of the perceptual trace 
(Group TO) is very damaging to Adams’ 
position, A 

The storage problem. Most existing the- 
ories, whether they stress the open-loop 
(programmed) aspects (e.g, Henry, 1960) 
or the closed-loop aspects of movement con- 
trol (e.g, Adams, 1971), implicitly postu- 
late that for each movement that is to be 
made there must be either a motor program 
or a reference against which to compare 
feedback (depending upon the type of the- 
ory), and that there is a one-to-one mapping 
between stored states (either programs Or 
feedback states) and movements to be made. 
This presents problems for the central ner- 
vous system in terms of the amount of ma- 
terial that must be stored, as discussed 
recently by MacNeilage and Macaus 
(1973) in the speech production an 
by Schmidt (in press) for motor a 
They estimated that for the English : 

age, considering inflections and accen S, 
im were some 100,000 phonemes required, 
and thus the same number of stored d 
When we add to this the nearly countless 
additional ways (other than speak m 
which individuals move their muscu "€ 
the individual must have a nearly count et 
supply of either programs or feedback be es 
in storage. While this may be poss! - 
and there is no evidence that it 1s not w- 
ble—it would seem desirable “fae x 
mechanisms that do not require this level o 

$ ly, a number of attempts at 

storage. Recently, 
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alternatives to the earlier Program or feed- 
back theories have been Proposed that do 
not require this volume of material to be 
stored, such as the MacNeilage (1970) tar- 


get hypothesis and the use of reflexes by 
Easton (1972). 


a "novel" move. 
to investigators 


example, Bartlett 
tein (1967 in 


There- 
able to 
nove] 
ngs of 


movements to overcom 


€ the shortcomi; 
existing theories in thi 


8 regard. 


om Williams & Rodney, in press). Ab- 
interpolated random group. 


THE Goat or THIS ARTICLE 


, 

1s 

The research and thinking that ey 

(1971) theory, and other pum few 

Ories as well, have generated in the omings 

years has revealed a number of tai in 
—both logical and empirical—as out 


res- 
: 2 a ne 

the previous Sections. The goal of the P the 
ent article is 


a theory that un con- 
Strong points of the Adams view but ortant 
siderably different in a number of pu^ a the 
Ways, so that it will be consonant attempts 
experimenta] literature, The theory a se tha 
to deal with discrete tasks, that 15, m an 
ave a recognizable beginning ane 5 
are usually quite short in time ( of types 
than 5 sec in duration). A number dditio? 
of discrete tasks are considered. In » P 
to the linear positioning respons ime at 
which Adams dealt, the theory is or move 
rapid “ballistic” tasks with very pd Jn 
ment times (e.g., less than 200 wor ( 
that are “open” as well as “closed cy» * 

ton, 1957) tasks that demand ner n ete 
fasks that haye maximum speed, hele nuo 
as goals. Tracking and other prn he: 
tasks are deliberately excluded fr sim", 
article because or the rather complex inp" 
taneous interaction between anal d 
and motor output, although pap si 
Possibility that the ideas presented pa kin? 
be useful at some future time for 


y 
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(see Pew, 1974, for an up-to-date review of 
the tracking literature). 


Tue THEORY 


- In order to correct for the shortcomings 
existing open- and closed-loop theories in 
accounting for the recent motor learning and 
performance data, a somewhat marked de- 
parture from these original points of view 
eee As with any new theory, 
- E is probably not very much really new, 
Eus many of the ideas borrowed from 
din os joe of view. Such is the case here. 
cation theory is explained in the next few 
DIA ns, it will become obvious that the 
Pan ge of the major ideas can be traced to 
: artlett (1932) in terms of the notion of the 
Schema, to Adams (1971) for his application 
Brides theory to learning of motor 
En d Pew (1974) for the suggestions 
skills he application of the schema to motor 
ae to Lashley (1917) for his lead in 
Tenis erzmg man as controlling his move- 
D» Tu with "motor programs. It 
m " open-loop control ideas and the 
program that we turn next. 


The Motor Program 


his idea that the human being h f 
MD ored muscle commands ready for action 
» d time has probably been with us for 
eem: long time, but the first important 
Conf mentation that movement was centrally 
^ ens was provided by Lashley (1917) 
with escribing the movements of a patient 
of " à gunshot wound in the back. Because 
ao e wound, the patient had lost all sensa- 
a from his lower limbs, but had not lost 
S" efferent pathways that enabled him to 
n ME Even though he could not feel move- 
lent in his leg, he was nevertheless able to 
iion it with surprising accu 
ed L normal control subject. 
m Lashley to argue for a position 
ovement was controlled centrally, 
tere was little possibility that the wounded 
Patient could have been using feedback to 
Suide his movements. 

This idea has been restate 
Sth formally and informally, and with the 
Advent of the electronic computer the cen- 
Talist notion became couched in terms ° 


as a set 


racy, not un- 
This finding 
in which 
since 


d many times, 
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the programs used by the machines; Henry's 
(1960) “memory-drum theory" was such an 
idea, using the notion of the memory drum 
element in the earlier computers as an anal- 
ogy to the human system. One of the more 
recent statements is that of Keele (1968), 
who defined the motor program as a se- 
quence of stored commands that is "struc- 
tured before the movement begins and allows 
the entire sequence to be carried uninflu- 
enced by peripheral feedback" (p. 387). 
The primary evidence for the motor pro- 
gram notion has been that the processes 
involving the generation of sensory error 
information, perceiving it, and initiating cor- 
in response to those errors was 
requiring from 120-200 msec 
"reaction time") to initiate the 
While it is true that there is 
e in the speed with which the 
channels operate, with pro- 
prioception being fastest, at about 110 msec 
(Chernikoff & Taylor, 1952), and vision 
being the slowest, at about 190 msec (Keele 
& Posner, 1968), there is still the problem 
that many movements can be carried out in 
far less time than is required for the feed- 
back loop to operate. In fact, the strongest 
human evidence for the motor program no- 
tion seems to be that subjects can initiate, 
carry out, and stop a limb movement within 
100 msec, implying that decisions about 
when to stop the movement must have been 
made prior to the initiation of the movement. 
These results have provided a serious di- 
lemma for the closed-loop performance the- 
orists, and have provided the support needed 
for the program notion to become popular. 
The idea of the motor program is largely 
a default argument, as pointed out by Pew 
(1974). There is really no direct human 
evidence of à motor program; centralists 
n that there is no other known means 
of producing the movements ; thus programs 
must be the explanation. Actually, it should 
be shown either (a) that feedback is present 
in movement but is not used, or (b) that 
feedback is not present and movement can 
still occur. Strictly, neither of these two 
possibilities has been shown experimentally. 
There are, however, data from subhuman 
species that provide support for the program 


rections 
quite slow, 
(about one 
corrections. 
some varianc 
various sensory 
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notion. The most convincing is the report 
from Wilson (1961), who totally deaffer- 
ented the wing musculature and related 
joints of locusts. When a ganglion near the 
head was stimulated electrically, the locust 
would produce movements of the wings 
closely resembling flying (although | the 
movements were decreased in amplitude 
somewhat), and these movements would 
continue for an extended period of 
without further stimulation. 

pears that there are motor pr 
that they are capable of carrying out move- 
ment in the absence of Sensory feedback. 
Similar data have been collected by Notte- 
bohm (1970) with birdsongs. The limita- 
tion of these findings for human motor be- 
havior is that these subhuman motor pro- 
grams can probably be considered innate, 
and as yet there are no data Showing that 
learned acts can be programmed, 


time 
Thus it ap- 


ograms, and 


~ven so, the 
rect and in- 
lusion here, 
is modified 


notion seems to have Sufficient di 
direct support to Warrant its inc 


separate motor prog 
and such a postulati 
problem mentioneq 
therefore been ch 
that there are ge i 
for a given clas 
ample, there mi, 


Tam associat 
on brings 
earlier, 


a baseball, How 
ements governed 
be has been the 
as some would hold 

is quite narrow 


; ow 
tually is does not m 


ever, as long 


i © postulated th 
there is not à one-to-one Match between the 


program and each Specific movement that the 
individual can Produce, in order to avoid the 
storage problem. 
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These generalized motor programs we «i 
sumed to be able to present the prestrt ts 
tured commands for a number of ee 
if specific response specifications are uH 
vided. Thus a motor program for throw! i 
a ball could be modified by specific imd 
tions to throw fast or slow. These Met 
tions can be thought of as parameters = f 
can be varied before the movement «p a 
enable the execution of the pogram qe 
different speed, a different force, and Sos 
Thus, the performer's problem in cp alge 
a movement is the determination of ine the 
Sponse specifications that will modify 
existing stored motor programs. in the 

According to the current literature p^ o., 
neurological control of movement otor 
Granit, 1970), the output from the P he 
program consists of two sets of signals. xtra- 
alpha efferent signals innervate the es alee 
fusal fibers in the main body of the: ee in- 
ture, while the gamma efferent signa idée 
nervate the intrafusal fibers in the m (he 
Spindles, "Through the servo-action A ac- 
Muscle spindles, the alpha and Legit 
tivity is coordinated, so that minute iei 
tions from the intended ia iti ie 
pattern of movement are corrected very "t 
idly—within 30-50 msec—via the ope 

le-initiated feedback loops. These jur 
type Corrections contro] for unplanned fri 
tion in the fatigue state, unexpected." the 
occurring in the limbs, etc. so tha (i.e 
Movement takes on the intended path wa 
the path intended when the progran jake? 
chosen), Thus, strictly speaking, 1t ain 
little sense in view of the evidence yp) have 
de Keele (1968) and Schmidt (1972b 

One, that movements with duration? yer 
than 200 msec take place “without on s 
Ment from peripheral feedback,” becats¢ ] 


K 
H quic E 
Spindle system Operates much more 4 itio" 


than this; What is meant by this de i 
of the motor program is that when th 
gram is initiated, it carries itself m 
planned, correcting for deviations kr 
intended path of the movement, but 
Something happens in the environment row 
Tequires that some 


new movement not 2] 
goal) be planned, the performer cat 
complish any 


4 the P 
Such changes until gimat 
Sram has run its course for appro? 


| 
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2 5 
ndr ogm s aee e 
t he periphery cannot ini- 
tiate a new program until the present one 
has run its course for one reaction time (see 
Schmidt, in press). 

Actually, the time over which a program 
must run its course without change is prob- 
ably far longer than the lower limit of 200 
msec usually specified. If the movement 
time is only 200 msec, any signal that the 
Subsequent movement is about to be incom- 
patible with the environmental conditions 
must occur very early in the movement; 
being able to correct for errors occurring 
this early seems unlikely, because the initial 
portions of the movement must usually be 
made in order for the subject to perceive 
that it is going to be incorrect. Depending 
upon the type of response, it is reasonable to 
believe that the program could carry itself 
out for nearly 400 msec before it could be 
npe. The amount of possible feedback 
à vement in a movement longer than one 
reaction time is probably dependent primar- 
ily upon the movement time, and seems in- 
dependent of the movement velocity so long 
às the movement time is constant (Schmidt 
& Russell, 1972). 

A primary focus of the present theory is 
On the development of motor programs and 
their response specifications, an emphasis 
Shared by a number of motor skills research- 
ers, Perhaps this point is made best in a 
Passage from a recent article by MacNeilage 


and MacNeilage (1973) : 


pe need for peripheral sensor: X 
hought of as inversely proportional to the ability 
of the central nervous system to predictively deter- 
mine . , , every essential aspect of the following 


acts. (p, 424) 


Because of the lags in proce 
the subject becomes less and less dependent 
Upon feedback for performance, and the em- 
Phasis shifts from feedback-controlled, jerky 
Performances to the smooth execution of 
almost completely open-loop movements. 
Thus, the problem for the subject in learn- 
ing motor skills is to develop these open-loop 
Programs for his movements to free himself 


from feedback involvement. | Pew (1966) 
has shown, in a task requiring alternate 
finger-tapping movements to keep a dot cen- 


ory feedback can be 


ssing feedback, 


tered on a screen, that there was a shift in 
control from closed-loop to open-loop con- 
trol. Schmidt and McCabe (Note 2) also 
found less feedback involvement in a discrete 
timing task with a 750-msec movement time, 
using the index of preprogramming dis- 
cussed by Schmidt (1972b). 


The Schema 


The notion of the schema is not a new idea 
at all, as the first formal statement of the 
idea was made by Head in 1926, and the 
ideas were subsequently modified consider- 
ably and presented in a book by Bartlett 
called Remembering (1932). The idea, usu- 
ally stated with respect to perception, is that 
in order to perceive a set of visual stimuli 
(eg. a dog) and to classify these stimuli 
correctly in the category “dog,” we need not 
have previously received the particular set 
of stimuli in question. Through our past 
experiences with seeing dogs, we store these 
stimuli in recognition memory and also ab- 
stract these stimuli into a concept related to 
dogs for additional storage. This concept 
forms the basis of a schema or rule for deter- 
mining whether a new set of visual stimuli 
should be classified into the category "dog" 
Thus to recognize an animal as a 
ave ever seen that partic- 
ular animal before, and with the use of the 
schema for dogs, we correctly identify the 
animal's category. This rule for determining 
the category membership of a set of stimuli 
forms the basis for the definition of the 
schema, as defined by Evans (1967b): 
racteristic of some population of 
ts of a set of rules serving as 
ducing a population prototype 


or not. 
dog, we need not h 


A schema is a cha 
objects, and consis 
instructions for pro 
(the concept). (P. 87) 


Thus, given the schema for 


consists of a set of rules for 
a set of stimuli should be classed as a dog 


or not, the individual comes to the deci- 
sion about the category membership of the 


stimuli. 
While the 1 
in existence for 


“dog,” which 
determining if 


notion of the schema has been 
a long time, it has been 
limited in its usefulness theoretically because 


there have been a number of ways of think- 
ing about the same idea (e.g., Head, 1926, 
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versus Bartlett, 1932), the ideas were some- 
what too "mentalistic" for the behavioristic 
leanings of psychology until recently, and 
there have not been attempts to operation- 
alize the concept so that it could be tested 
experimentally. Recently, however, a num- 
ber of researchers have invoked the notion 
to explain nonsense pattern recognition, and 
Evans and his colleagues (Edmonds & 
Evans, 1966; Edmonds, Evans, & Mueller, 
1966; Edmonds & Mueller, 1966; Edmonds, 
Mueller, & Evans, 1966; Evans, 19672, 
1967b; Evans & Edmonds, 1966) and 
Posner and Keele (1968, 1970) have pro- 
vided strong operational definitions and tests 
of the concept. The descriptions of the 
Posner and Keele studies indicate how the 
Schema has been tested for pattern recogni- 
tion tasks. 

Posner and Keele (1968, Experiment IIT) 
presented a series of random dot patterns on 
a screen to subjects. They had three basic 
9-dot patterns (the "prototypes"), and they 
produced variations (called distortions") 
of these prototypes by randomly moving the 
dots in the original patterns, In trainin 
24 of the distortions were presented, and ui 
subjects learned with practice and knowl 
edge of results (KR) to categorize the E 
correctly (Edmonds, Mueller, & E m 
1966, have shown that KR is dt dens) 


for this categorization to Occur), The od. 
inal prototypes from which the distortions 
were created were not Shown. Follo a 
Benne, a transfer test was given esting 
RM 6 distorts Pes (not seen pe 
y ns shown in trainin 12 
distortions no n 
unrelated, cadena pios 


upon the degree 


win s T 
indicate that subjects can Sinealiy oe 
dot patterns that they have never seen or 


patterns of a i 
category, stored it, and were E 


able to use this 
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stored abstraction to recognize the prototype 
pattern when it was later presented. A sub- 
sequent study by Posner and Keele (1970) 
showed that when the transfer test was pro- 
vided 1 week after the original learning, the 
recognition of the prototype did not decrease 
over the retention interval, whereas the I6c- 
ognition of previously seen distortions of it 
did. This implies that the schema for the 
dot-pattern classes was retained nearly per- 
fectly, while the actual patterns seen pre- 
viously could not be stored as effectively. 
These data Clearly suggest that subjects 
store both the patterns seen and the schema 
(the abstraction) of the pattern (although 
at different strengths), and that the schema 
allows subjects to recognize the prototype 
pattern without ever having seen it prove 
ously. These data, in addition to the Evans 
experiments cited previously, provide an €X- 
perimental paradigm and rather strong eV 
dence for the schema concept, at least tor 
stimulus recognition, The next sections dea 
with the extension of the schema to the area 
of motor response production and motor T€ 
sponse Tecognition, respectively. 


The Motor Response Schema 


Bartlett (1932) discussed the notion a 
the schema as a means of solving the storag™ 
Problem for response production and es 
means of Senerating novel responses. 
WaS quite clear that some such mechan 
Must exist, as evidenced by the following * 


the 
ce make the [tennis] stroke depends upp tthe 
siang of certain new experiences, most O jences 
Visual, to other immediately preceding exper! 


e to my Posture, or to balance of postures 
the moment, (p. 201) 


EX es ion 
Then, after one takes in this informat 


en- 
20 the present state of the body and 
vironment, Bartlett Says, 


nism 


When I make 
act, produce 
never merely r 


rer! Ob 
the stroke I do not, as a nee I 
something absolutely ne 2) 
epeat something old. (p. 20 


H " cisely 
€ was not Very clear concerning pre 


how ithe schema would operate ane 2 i 
earning would be handled, but recently sie 
(1974) has provided some additional th 
ng about the schema, While the views 


t the , ike 
Pressed in this article are not exactly 
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Pe 

ue clearly had their origin in Pew's 
E sta of the schema. as an abstrac- 
[caion S Langit a, Sn yee 
production. Basal =e as alia 
E anen ? n the in a lual 
Sa nent that attempts to satisfy 
d pertes stores four things: (a) the 
E a itions, (b) the response specifi- 
ry d or the motor program, (c) the sen- 
23 piane of the response produced, 
eta the outcome of that movement. The 

section explains each of these con- 
Structs, 

Initial conditions. A number of writers 
(ee, Keele, 1968; Pew, 1974) have indi- 
P. nat in order for the subject to move 

ectively, he requires information about the 
d ed state of his muscular system and 
The RU EE in which he is to move. 
liters conditions, then, consist of the 
Cantus ion received from the various re- 
Dhibke P aig to the response, such as pro- 
of "e [ae Loo Don about the positions 
Mesi ce and body in space, as well as 
Give n auditory information about the 
BE. d the environment. After the move- 
Homer he initial conditions used to plan the 

nent are stored. 
oe specifications. 
ends: fe ter generation of th r 
bead. assumed to be rather general, with 
s : ns of the basic pattern possible by 
net ng such important elements as the 
rd with which it is run off, the forces 
nidi. etc., the subject must specify these 
off m before the movement can be, = 
T fter the movement, these spec! E 
that are stored along with the other a or- 
ie on received after the movement. qM 
beoe as a record of the specifications of the 

ment produced. 


Sensor 
inf ensory consequences. 
Ormation stored after the movement 15 


the response-produced sensory information. 
his information consists of the actual feed- 
ack stimuli received from the eyes, ears, 
Proprioceptors, etc. Thus, the sensory COn- 
Sequences are an exact copy © the afferent 
Information provided on the response. 
Response outcome. The fourth source of 
‘formation stored after the movement 15 the 


Since the motor 
e muscle com- 


The third type of 
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success of the response in relation to the out- 
come originally intended. The desired out- 
come (or goal) of the movement is poten- 
tially a verbalization, such as, “throw the 
dart very hard at the center of the target” 
(although the desired outcome need not ac- 
tually be verbalized), and the response out- 
come is in these same terms, such as, "you 
threw 23 mm to the left." Thus, the actual 
outcome of the movement is stored, not what 
was intended. This outcome information 
arises from information the subject receives 
after the movement, and consists of KR 
(when present) and subjective reinforce- 
ment that the subject obtains from other 
sources of feedback. The accuracy of the 
outcome information is thus a direct function 
of the amount and fidelity of the feedback 
information, and a subject without any feed- 
back information does not have outcome in- 
formation to store. 

Schema formation. The above four sources 
of information—initial conditions, response 
specifications, sensory consequences, and re- 
sponse outcome—are stored together after 
the movement is produced. When a number 
of such movements have been made, the sub- 
ject begins to abstract the information about 
the relationship among these four sources of 
information in a way suggested by the dot- 
pattern discrimination experiments. The 
schema notion requires some extension from 
the original pattern-perception idea, how- 
ever, in that in the motor case it is the re- 
lationship among the arrays of information 
that is abstracted rather than the commonal- 
ities among the elements of a single array. 
The strength of the relationship among the 
four stored elements increases with each suc- 
cessive movement of the same general type 
and increases with increased accuracy of 
feedback information from the response out- 
come. This relationship is the schema for 
the movement type under consideration and 
is more important to the subject than is any 
one of the stored instances, which, according 
to Posner and Keele (1970), are forgotten 


more quickly over t! the schema. 
Figure 2 diagrammatically shows how these 
sources of i ated to form 


the schemata. 
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teroceptive feedback consists of anticipated 
Pee — etc., of the environment and 
; and the objects moved by it. 
J eee prensa after the movement, each 
on are sensory consequences is 
s don h the respective inflow of sen- 
mation (the proprioceptive and 
exteroceptive feedback, respectively), and a 
resulting mismatch in the expected and ac- 
tual sensory consequences produces an error 
that is fed back to the schema, providing 
information (subjective reinforcement) as to 
the outcome of the response produced. These 
Processes are symbolized in Figure 3, in 
which the recall and recognition schemata 
are combined (ie. the “motor response 
schema”) to increase the clarity of presen- 
tation. 

Since the expected sensory consequences 
are dependent upon the developing relation- 
Ship between the response outcome (as de- 
termined by KR or other subjective rein- 
forcement) on previous trials and the sen- 
sory feedback actually received, the strength 
of the recognition memory should increase 
as a function of both KR in initial practice 
and the quality and amount of feedback re- 
ceived on each trial. This portion of the 
schema rule is the basis of recognition mem- 
ory for the movements of the type governed 
by the schema, This recognition memory 
is assumed to be analogous to recognition 
in verbal learning—including the 
assumption that it is independent of motor 
recall memory (i.e. that memory associated 
with the response-production portion of the 
schema), even if some of the variables (e.g., 
KR and subjective reinforcement) are sim- 
ilar for the two memory states. 

An important point should now be made 
about the nature of the expected sensory 
Consequences generated before the move- 
ment. In order for the subject to be able 
to receive information about the correctness 
of the movement in relation to the desired 


Outcome, he must be able to compare the 


actual feedback with the feedback expected 
nvironmental 


if the movement achieved the € 

goal, and thus the expected sensory conse- 
quences must represent the feedback conse- 
quences of this correct movement. This 1s 
in contrast to a number of earlier theories 


memory 


(e.g., Anokhin, 1969; Bernstein, 1967 ; Pew, 
1974; Sokolov, 1969) in which the expected 
sensory consequences are of the movement 
actually chosen; such expected sensory con- 
sequences can provide information about the 
extent to which that movement was carried 
out faithfully, but can provide no informa- 
tion about whether appropriate response 
specifications were selected. A major differ- 
ence between these theories and the present 
one concerns the way in which these ex- 
pected sensory consequences are chosen. 

In the present theory, the subject begins 
the process by selecting a desired outcome 
and by noting the initial conditions at the 
time. The relationship between actual out- 
comes and sensory consequences (the recog- 
nition schema) allows the generation of a 
set of expected sensory consequences that 
represent the best estimate of the sensory 
consequences of the correct movement. Also, 
the relationship between actual outcomes and 
response specifications (the recall schema) 
permits the generation of the response spec- 
ifications that are the best guess as to how 
to achieve the desired outcome. With this 
method of selection, even though the ex- 
pected sensory consequences and response 
specifications are strongly associated, there 
is no necessity that they be isomorphic, as 
with the earlier theories, because the recall 
and recognition schemata are separate. While 
the two schemata do share the initial condi- 
tions and actual outcomes as variables, they 
are separate because the recall schema is the 
relationship between these variables and re- 
sponse specifications, whereas the recogni- 
tion schema is the relationship between these 
two variables and sensory consequences. 
Further, it is possible that the subject may 
choose inappropriate response specifications 
al n, recognize his error, update his 
produce different specifi- 
cations on trial n + 1, even if the initial con- 
ditions are identical; even though two sets 
of response specifications were generated on 
the two trials, the expected sensory conse- 
quences could have remained constant be- 
cause the desired outcome did. 

In the earlier theories, however, the sub- 
ject first chose the particular movement pat- 
tern and then chose the expected sensory 
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ison of expected and actual consequences, 
and (b) this raw signal after it has been 
labeled and converted into a reportable form. 
This notion is certainly not new, as Bartlett 
in 1932 said, 


Pregun processes, in fact, involve two different, 
but related functions: (a) that of the sensory pat- 
tern, which provides a physiological basis for per- 
ceiving; and (b) that of another factor which con- 


structs the sensory pattern into something having 
a significance which goes beyond its immediate 
sensory character. (p. 188) 


While this raw signal can be used by the 
subject, most notably in those situations 
where the subject makes a series of adjust- 
ments in order to reduce his error signal to 
zero in positioning tasks, it is the labeled 
error signal that plays a role in explaining 
learning. The labeled error signal is termed 
subjective reinforcement and can serve as a 
Substitute for knowledge of results (KR), 
providing outcome information that can up- 
date the recall schema. Subjective rein- 
forcement is less accurate than is the perfect 
KR, and so the subject will probably use 
KR to update the recall schema when KR 
is present, but when KR is not present, the 
subject can resort to the less accurate sub- 
jective reinforcement for his outcome infor- 
mation. 

The error labeling system is proposed to 
be another schema, in this case a schema for 
labeling sensory signals. It is assumed that 
past sensory signals have been stored along 
with the actual sensory consequences (based 
on KR), and a schema rule is built up over 
time that relates the KR received to the 
signals received. When this schema is well 
established, it enables the subject to receive 
error signals from the expected-actual com- 
parisons, and to attach a label to these sig- 
nals according to the KR-sensory-signal re- 
lationship. Thus, in later practice, the sub- 
ject can attach a label to a new sensory error 
signal that he has not experienced previ- 
ously, and the result is subjective reinforce- 


ment. Of course, the essential ingredient for 
the error labeling system is KR, and without 
e schema for 


it subjects cannot deve 
labeling new errors. 


lop th 
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Producing a Movement 


Given the notions presented in the pre- 
vious sections, it will be useful to describe 
how the schema operates in producing a 
motor response by explaining the steps in 
the movement in the order that they are 
thought to occur. Figure 3 shows the move- 
ment process, beginning with the specifica- 
tion of a desired outcome and subsequent 
determination of the initial conditions. From 
the relationship established in past responses 
between outcomes and response specifica- 
tions, new specifications for the motor pro- 
gram are selected. Also, at the same time, 
the relationship between outcome and sen- 
sory consequences allows the selection of the 
expected proprioceptive feedback and ex- 
teroceptive feedback. When these processes 
have occurred, the movement can be initi- 
ated by running off the motor program. 

Immediately following the initiation of the 
motor program, the impulses begin to flow 
out to the muscular system with all of the 
details of at least the first 200 msec of the 
movement specified. As the movement 1S 
carried out, sensory receptors in the body 
provide information about the movements 
occurring. For convenience only, these re- 
ceptors are classed into two groups—Ppro- 
prioceptive feedback and exteroceptive feed- 
back—although it should be recognized that 
this classification does not represent any 
meaningful differences in the way these feed- 
back sources are assumed to operate. The 
feedback information is fed back to the ex- 
pected proprioceptive and exteroceptive feed- 
back states, respectively, and the discrepancy 
between these anticipated and actual states 
represents an error in responding. The raw 
error is fed back to the schema so that in 
positioning movements, for example, addi- 
tional adjustments can be made to reduce the 
error to zero. This error is also fed to the 
error labeling system, where the subject as- 
signs a reportable label to it, and this result- 
ing subjective reinforcement is then fed back 
to the schema as subjective information. 

The final source of error is the informa- 
tion that the experimenter Or teacher pro- 
vides for the subject after the movement, 
termed KR. It arises from the measured 
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outcome of the response ("the ball missed 
the target by 20 cm to the left”), and is 
already in a form that is interpretable as a 
deviation from the desired outcome. The 
KR information, in addition to being fed 
back to the schema for updating the schema 
rules, is fed to the error labeling system to 
enable this system to improve its accuracy 
in labeling future error signals arising from 
the deviations of proprioceptive and extero- 
ceptive feedback from their respective antic- 
ipated states. 

Learning is possible by feeding back the 
essential error information to the schemata. 
The response specifications and initial con- 
ditions are stored when the movement is 
selected, and the actual proprioceptive and 
exteroceptive feedback are stored as the 
movement is progressing and as these sources 
of information are generated. Finally, the 
actual outcome is stored, based on KR when 
it is present, but based on subjective rein- 
forcement if KR is not present. These 
sources of information can then be used to 
update the schema rules and provide revised 
estimates of the expected sensory conse- 
quences and response specifications on the 
next trial. The traces Tepresenting these 
sources of information are hypothesized to 
be relatively weak, so that rapid forgetting 
of them can be expected over time. How- 
ever, the traces are assumed to be held in 
Store sufficiently long so that they can be 
used to update the appropriate schemata, 
The schemata, on the other hand, are as- 
sumed to be stored far more permanently, 


although some forgetting of them over time 
could be expected, 
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The present theory extends the notions of 
Adams’ (1971) theory to Tesponse types 
other than linear positioning, and can handle 
the learning and performance of discrete 
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tasks that are either open or closed (Poul- 
ton, 1957) and either rapid or slow. In 
addition, a type of discrete response not fre- 
quently discussed is one in which the per- 
former continually strives to do “more of 
something,” such as moving faster, jumping 
higher, or producing more force, and this 
type of task does not have accuracy, in the 
traditional sense at least, as a goal. Fur- 
ther, the theory has the capability of explain- 
ing novel movements that may not have been 
produced previously. " 

Open versus closed skills. Poulton (1957) 
defined open skills as those in which envi- 
ronmental and situational characteristics can 
change as the subject plans or performs his 
response, such as the response that would be 
required of a wrestler attempting a take- 
down. Closed skills, at the other extreme, 
are those in which the goal and environmen- 
tal conditions are relatively constant. The 
important distinction between these two 
types of movement was recognized by Gen- 
tile (1972). Essentially, with closed skills 
the performer need only learn one movement 
that satisfies the goal, whereas with open 
skills, the environmental conditions are 
never constant, and the subject must plan 
his response to meet the anticipated situa- 
tional demands of the enviroment. The 
question of the novel movement is partic- 
ularly important for these open skills, since 
performers must make n 
fore made if the environment is not exactly 
the same as for previous movements. 


_ Theoretically, with closed skills the sub- 
Ject dev 


elops the schema rule so that = 
makes finer and finer predictions on t? 


novements never be- 


basis of the tule, and the expected sensor? 
consequences are more and more accurate 
estimates of the consequences of that one 
movement, This is much the same as Adam 
has it (but the mechanisms are different. 9" 
course). The schema rule produces incre? 
ingly accurate estimates of response spe d 
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aga outcomes and response specifica- 
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Rapid versus slow responses: The pres- 
ent theory extends Adams’ (1971) theory 
dealing with slow, graded, linear positioning 
Tesponses to movements that are too rapid to 
allow feedback to be used while the move- 
Ment is in progress. T heoretically, these 
two types of responses are controlled and 


-€arned in very different ways. 
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In slow positioning movements, the schema 
rule determines the expected sensory conse- 
quences for the end of the movement speci- 
fied by the desired outcome. Then the sub- 
ject moves along the track and compares the 
response-produced feedback with the ex- 
pected sensory consequences. He continues 
moving as long as his errors (the discrep- 
ancies between feedback and the expected 
consequences) are being reduced, and then 
finally homes in on the target location by 
reducing his error to zero. Even though the 
subject is moving actively and some (eg. 
Marshall, 1972) have termed these Bo 
ments "recall," the present position is that 
the subject is using feedback compared with 
a recognition state, and hence the subject is 
moving to that location that he recognises as 
being correct. Such slow responses are de- 
pendent on recognition memory and the rec- 
ognition portion of the schema rule. 

What about error detection and the deter- 
mination of subjective reinforcement after 
the movement? Clearly, since the subject 
has used the comparison between expected 
sensory consequences and response-produced 
feedback as the basis for determining the 
endpoint, he will have subjective reinforce- 
ment that is zero, reflecting the fact that he 
moved to that position for which the error 
signal was zero. He can, therefore, have no 
further estimate of his own error with re- 
spect to the target on that trial unless he 
receives KR. He cannot, therefore, continue 
to learn these movements without KR, be- 
cause he requires KR and/or subjective re- 
inforcement in order to strengthen the re- 
sponse-recognition schema. If the subject 
Jacks this information, the schema remains 
stable, and the subject maintains his per- 
formance. If the level of practice js low, and 
the schema is not well developed, he may 
regress in performance after KR withdrawal 


because of inappropriate guesses about out- 
h the sensory conse- 


come being paired witl 
accuracy of 


quences, to the detriment of the 
the schema rule. 
Now let us con 
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in which the subject learns to move 30 cm 
in exactly 150 msec) are completed before 
the subjective reinforcement can begin to 
have an effect. Even if it cannot act on that 
trial, however, the subjective reinforcement 
is present nevertheless, and still signals the 
extent to which the expected Sensory conse- 
quences and response-produced feedback 
matched, and hence the Subject has an esti- 
mate of his Tesponse correctness after the 
movement has been completed, This is fed 
back to the schema as actual outcome, and 
the rule between outcome and Tesponse spec- 
ifications is strengthened. If KR js with- 
drawn, the subject has information about 
Tesponse correctness from subjective rein. 
forcement based on Proprioceptive and ex- 
teroceptive feedback, and Performance can 
be maintained. If sufficient KR practice 


Many tasks 


x empt 
more of Something, such as throw ii 


jump higher, 


resentation 
feedback States, 


the Perceptual trace can 


past Consequences. 
what better estimate 
the subject achieved 
outside of the r 
executed, 


Two C, ompartments o 


While there ar 
Postulated, they sh 


[M emory 


€ two states of memor 
ould not be seen as totally 


^ 


Ricwarp A. Scuarpr 


independent, for they are hypothesized to de- 
velop using some of the same variables. The 
recall schema depends on the actual outcome 
(KR or subjective reinforcement), the ini- 
tial conditions, and the response Specie 
tions, with the rule updated after each tria 
by an integration oi this new information 
into the existing schema, By contrast Te 
recognition schema develops using the - 
conditions, the Sensory consequences, an 
actual outcome integrated into the Le 
recognition schema, Thus, while some © 
the variables are different, both schemata 
are clearly dependent on the actual outcome 
and the initial conditions, and both che 
will develop according to the experience 5i 
individual has had with these variables. Ke 
the other hand, they are separate in that t a 
recall schema uses response specification 
and the Tecognition schema uses e m 
sensory consequences, The degradation 5 
feedback when KR is present should have n 
effect on the recall schema, since KR is a 
as the estimate of actual outcome, but shot 
have detrimental effects on the recogniti? 
Schema, 1) 
As was pointed out earlier, Adams' dae 
theory has difficulty in explaining how v 
could continue to learn a positioning how 
after KR been withdrawn, or even ance 
Simply maintain perform the 
1 The problem was that i tua 
subject ever made an error, the percep he 
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rements in performance might occur because 
of incorrect guesses about the actual out- 
come, with inappropriate updating of the 
Schema resulting in the performance decre- 
ments commonly seen (e.g., Bilodeau, Bilo- 
deau, & Schumsky, 1959). 

In rapid movements, however, the KR- 
withdrawal effects are somewhat different. 
W hen enough practice with KR has been 
provided so that the subject has an accurate 
recognition schema, if KR is withdrawn the 
subject has the capability to store all that is 
necessary for continued recall schema devel- 
Opment: the initial conditions, actual out- 
come (from subjective reinforcement), and 
response specifications. Hence, if the sub- 
ject makes an error, it does not degrade 
the recall schema mechanism, but rather 
strengthens it, since in an error response 
actual outcome and response specifications 
will be lawfully related in the same way that 
they are in more nearly correct movements. 
The result is that the recall schema is at 
least maintained in strength, and is perhaps 
increased in strength, with KR withdrawal. 
Of course, the recognition schema remains 
stable because it requires the pairing of ac- 
tual outcome and sensory consequences; in 
this case the actual outcome and sensory 
consequences are redundant, since the for- 
mer was computed on the basis of the latter, 
and no updating can occur. 

Some SUPPORTING EVIDENCE 


This section provides some experimental 
evidence that is taken as support for the 
various hypothetical processes assumed to 
occur in the performance and learning of 
discrete tasks. In addition, independent var- 
iables that are hypothesized to influence the 
various processes are mentioned, and experi- 
mental paradigms for subsequent tests of 
these ideas are suggested. Before turning 
to the evidence, however, à discussion is 
nécessary concerning the dependent varia- 
bles assumed to be indicative of the pro- 


cesses and states proposed. 


Dependent Variables 
Recently there has been 
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performance in tasks demanding accuracy. 
While the issue is certainly not new (eg., 
Woodworth, 1938), Schutz and Roy (1973) 
have reopened the argument, advocating the 
use of constant error and variable error as 
measures of accuracy, arguing that absolute 
error should not be used because it is con- 
founded with constant error and variable 
error. It is appealing to have measures that 
are independent within subjects such as con- 
stant error and variable error, allowing var- 
iable error to represent consistency and con- 
stant error to represent directional biases, 
but it can be argued that absolute error is 
to be preferred as a measure of performance. 
First, having two dependent measures in one 
experiment can allow opposite conclusions 
to be drawn about the processes that they 
are thought to estimate (e.g, Dobbins & 
Rarick, 1975), leading to equivocal findings. 
Second, absolute error has a solid history of 
use in psychology and motor behavior, prob- 
ably because it is intuitively meaningful; it 
is the amount by which the subject was in- 
correct in his movement. Third, one can 
imagine the performance of an individual 
who is told that he will be financially re- 
warded for reducing his constant errors on 
a series of trials; he will probably move on 
trial » to cancel out a constant error made 
on trial » — 1, and will not attempt to 
achieve a correct movement. It seems that 
individuals try to reduce the amount by 
which they are incorrect in the task (ie. 
the absolute error), and measuring constant 
error and variable error when the subjects 
are not aware of it could provide misleading 
data. For these reasons, the use of absolute 
error as a measure of recall-schema strength 
seems justifiable. . } 
In some of the experiments cited in this 
variable error is used as a substitute 
for absolute error, and it can be argued from 
Schutz and Roy’s (1973) analysis that the 
variable error and absolute error are prob- 


ably measures of the same state or process, 
e constant errors in such tasks are 


nearly zero. Finally, Henry (1974) advo- 
cates the use of the total variability of the 
subject's scores around the correct value as 
a measure of overall accuracy in responding, 
and this measure is probably highly associ- 
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ated with both absolute error and variable 
error under conditions where constant error 
is nearly zero. 

Motor recognition. A number of research- 
ers (e.g., Newell, 1974; Newell & Chew, 
1974) have argued that the appropriate sta- 
tistic for the estimation of recognition mem- 
ory strength is the absolute value of the 
difference between the objective and subjec- 
tive error, and that the within-subject cor- 
relation does not indicate the extent to which 
the subject is incorrect in his judgments, 
only that he is sensitive to the direction of 
his errors. They argue that he could be in 
error by a constant of 100 msec (ie. his 
average guess might be 100 msec too large) 
and yet this fact would not be represented 
by a drop in the correlation. While they 
are correct on this point, there are a number 
of reasons that the correlation is to be pre- 
ferred to the difference score. 

First, there has been negative reaction to 
difference scores (eg. Bereiter, 1963; 
Schmidt, 1972a), and the objective-subjec- 
tive difference score is not free from unde- 
sirable effects, which include (a) an extreme 
correlation with the minue 
case is the absolute error (r = .80 in Schmidt 
& White, 1972), (b) 
of relationship with t 

case, the recognition-sc 
the difference score a 
and (c) statistical unreli 
summation of errors j 
two components, 
present theory is t 


n measurement for the 
A further difficulty for the 


he fact that the objecti 
sent r a Jective- 
subjective difference and the absolute error 


theoretically are measures of Separate state 
of memory ; however, the method of i 
putation of these two Statistics allows the 
inclusion of absolute error in both measures 
leading to a strong correlation between these 
two scores, even to the point that the plots 
of these two scores over trials yields near] 
identically shaped functions. In such cree 
it would be possible to show that a given 
experimental variable influences both the 
recognition and recall 


r measure identically 
because of a statistical artifact, leading to 


the erroneous conclusion that recall and rec- 
ognition were not separate, 
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The objective-subjective correlation is to 
be preferred, because it is clearly Seriea 
to the direction of the subject's guesses. 
also counters a strategy employed by some 
subjects whereby they guess the correct 
score (e.g., 200) on each trial of a 200-msec 
task. In such cases the correlation is v 
reflecting a total insensitivity to the size an 
direction of the error, whereas the difference 
score is minimized (especially if the constant 
error is zero for that subject), making him 
appear to be quite talented in predicting his 
score. As a measure of the strength of the 
recognition schema, therefore, the carro 
tion statistic is to be preferred, but could | & 
supplemented with the average objaaive~ 
subjective difference to be certain that irs 
were not large constant errors in guessing. 


Support for the Schema Notion 


The most important portion of the present 
theory is the schema, and this section dinde 
with experimental and other evidence for it- 
The recall and recognition schemata will be 
considered Separately, n 

Recall schema. Strictly speaking, va 
little published work supporting a schema 
for response Production can be found. How- 
ever, there are a number of studies sugges 
tive of the schema’s existence, and there ' 
Subjective-anecdotal evidence as well. at 

Anecdotal evidence for the schema is ont 
Subjects can Produce sequences of eer 
ments that they have never performed : 
fore. A classic example concerns hand WT! 
Ing, in which a person's signature can 
a cognized as his regardless of the size c 
the actual marks produced ; one can produ 
the same Signature on a check or 10 ti™ 
larger on a blackboard ( Merton, 1972, p. 

course, quite different musculature 


f t ove 
Used in these two signatures, with the P? ps 


“hap? 
ments confined to the fingers and per! mi 
the hand in sm 


t all writing, and the entire p* 
im larger Writing. The schema notion are 
Plains this by saying that the movement? nat 
all run off by a large motor program rde! 
needs to haye certain specifications 1" pai 


to produce a given movement Sed" e- 


E nove 
hen the Specifications are that the jante 
ment should be small, rapid, and § 


| 
| 
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Phy the program can carry out these 
ments as planned. Of course, a motor 
ri theory such as that of Henry 
K A cao handle these findings, since 
Tesut vw that the movements are the 
different a specific motor program, with a 
uU program needed for each movement 
style.” The fact that individuals can do 
this seems to be evidence directly in conflict 
With such one-to-one program — movement- 
Style ideas. 
ies (1974) adheres to the idea that the 
ment is programmed rather generally, 

and then the movement parameters are spec- 
ified before the movement is initiated. In a 
Study by Armstrong (Note 3) a sequence 
of lever movements was learned in a number 
of sessions. An important result was that 
the timing in the sequence appeared to be 
ale die in that when the movement was 
eed too rapidly. the entire sequence was 
shi ded up, keeping the temporal relation- 

ps among the various submovements ap- 
proximately constant. The implication that 
Pew (1974) draws from these data is that 
timing is one of the response specifications 
that serves as a "parameter" of the motor 
Program.? Also, Glencross (1973) found in 
handwheel cranking that the movements 
Were very similar in terms of the timing of 
the onsets of force application even when 
resistance was added and the radius was 
changed; he labeled the phenomenon “ora- 
dation of effort.” It is as if there was a 
program for cranking, and that changing the 
quality of cranking (ie., less force, more 
Speed, etc.) provided a situation in which 
the subject still used the program but with 
a different set of specifications. While the 
present article does not attempt to deal with 
Such continuous responses, these findings at 
least suggest that a schema-like process 
could exist in discrete movements as well. 

However, the most impressive kind of 
evidence that could be generated in support 
of the schema is that subjects can produce 
movements of a given class that they had, 
Strictly speaking, never performed prev 
a 

? See also Brooks (1974 


appeared to be a parame 
movements in monkeys. 


), who found that timing 
ter of alternating hand 


ously; an example is the basketball player 
who shoots from various places on the floor 
with great accuracy. The notion is that the 
varied previous shooting experiences led to 
increased schema strength, providing a basis 
for generating novel movements of that same 
class. One important prediction is that in- 
creasing either the amount or the variability 
of such previous experiences lead to in- 
creased schema strength. These predictions 
suggest a test of the schema notion in terms 
of transfer of learning. 

Experiments testing this idea have been 
conducted, and some of them are quite old 
(e.g., Crafts, cited in Ellis, 1965; Duncan, 
1958). For example, Duncan used a task 
in which the subject had to position a lever 
into one of 13 slots depending upon which 
of 13 lights was illuminated, with the move- 
ments being done as quickly as possible. 
The task could be varied to produce 12 sim- 
ilar versions, and subjects received either 1, 
2, 5, or 10 variations in a training session, 
with the total number of trials held con- 
stant. Duncan found that the amount of 
transfer to two novel variations of the task 
was a positive function of both the amount 
and variability of training, especially as the 
variability was increased from one to two 
tasks. While this study supports the pre- 
dictions from the theory strongly, it coul 
be argued that what was learned and trans- 
ferred were cognitive-conceptual relation- 
ships related to which light went with which 
slot, and the findings may not tell a great 
deal about the existence of schemata for 
producing movement. To support the recall 
schema idea, a task should be used that in- 
volves primarily motor learning, with min- 
imized cognitive aspects, So that transfer as 
a function of response variability could be 
attributed to the development of schemata 
that define the response specifications. 

Schmidt and Shapiro (unpublished study, 
1974) used Duncan's design with a task that 
could be considered more motor in nature 
than his light-slot task. The task involved 
moving the preferred arm and hand to knock 
over four small barriers in a predefined 
order, and minimized movement time was 
the subject’s goal. The task was varied by 
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changing the locations (but not the orders) 
of the barriers, so that four tasks with 
slightly different movement-segment lengths 
and intersegment angles were defined. One 
group performed three tasks (40 trials of 
each with KR) while a second group per- 
formed one task (120 trials with KR). 
When subjects were transferred to a fourth 
novel version of the task, there was some 
tendency for the group with high variability 
(three tasks) to perform more quickly than 
the group with low variability (one task), 
with these differences increasing somewhat 
Over subsequent practice with KR, but the 
differences were small and did not reach 
statistical significance. 

The failure of the Schmidt and Shapiro 
study to support the schema notion does not, 
of course, disprove it, as there are a number 
of alternative explanations. First, the task 
involved the preferred arm and hand with 
college-age subjects, and it is possible that 
Schemata for movement production had al- 
ready been developed throughout extensive 
previous movement experience with that 
limb. This suggests that future work along 
these lines should use limbs and movements 
that have not been used extensively, or 
should use younger subjects in whom the 
schemata have not been developed. Also, it 
is possible that the variations of the task 
did not produce movements that were suffi- 
ciently different, or produced movements 
that were different in the "wrong" ways. 

. Recognition Schema. As was mentioned 
evidence n the dospttmy re s erae 
Tes for the ML. y 7 recognition stud- 
dence indicates tl at subject ih, This eyi 

cates that subjects not only 
Store the individual patterns seen, 


can store some abstraction of the 
This abstraction is th 


of a set of rules for 
given new pattern is or js not a mem 
the class of objects desc 
rule. As stron € seems t 
be, there are two essential Ways in which the 
experimental situation dealing with dot x 
terns differs from the motor Besoin 
schema in the present theory, * 
First, the schem 
represents the cen 


can 
but also 


a in the dot-pattern work 
tral tendency of the past 
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sets of stimuli seen. This is different from 
the present situation in that the recognition 
memory for, say, a novel ball throw may not 
represent the central tendency of all balls 
previously thrown. To have it represent the 
central tendency would provide a recognition 
state that would not be associated with a 
new response, that might not be near the 
central movement pattern, and that would 
not provide recognition for novel move- 
ments. Rather, what the schema represents 
is the relationship among a number of 
sources of information stored on each trial, 
and in the motor recognition case, these are 
the actual outcome and the actual sensory 
consequences. Hence, the schema notion 15 
extended from becoming the central tend- 
ency of a set of stimuli to the relationship 
between paired members of two sets of 
stimuli. 

A second and less cautious extension of 
the recognition schema notion is to types. of 
stimuli other than the visual patterns studie 
by Posner and Keele (1968, 1970) and 
others, and it is assumed that such abstract- 
ms Processes can occur with auditory, tac 
tile, and kinesthetic stimuli as well, There 
1S some support for recognition schemata 
with kinesthetic stimuli in a report by Wil- 
liams and Rodney (in press), As men- 
tioned earlier, blindfolded subjects attempted 
to learn a criterion position along a track- 
Way m two different ways. Group TO (tat 
Set only) moved 16 times to a stop at the 
criterion position, whereas Group IR (inter 
polated random) moved to 16 randomly Or 
dered Stops that surrounded the criterio? 
Position; Group IR was instructed that the 


ee, ou e 
criterion position was in the center of jw 
6 ps Presented. The critical trans 
test 


see Figure 1) showed Groups TO - 
performing with nearly equal absolu 
errors on the first block, but Group 
maintained performance over subseqe 


blocks, whereas Group TO regressed sharP 
in its performance, 


nt 


M catio? 

"p à recognition schema for the loca ilit 
along the trackway; the reduced varia 
for Group TO 


a 
3 à : hen? 
did not permit this scl 


ethan Then, on the transfer trials, 
roup IR subjects could generate the ex- 
ges ume consequences associated with 
Bor Me on position (even though they had 
End vu grita this position previously); 
À u then move so as to match the in- 
g proprioceptive feedback with the ex- 
pP proprioceptive feedback. The recog- 
RR ea strength was approximately 
IR en over the five blocks for Group 
A d for Group TO the traces associated 
a he criterion position apparently faded, 
gem mg in less accurate responding for 
Grou n 'The important point 1s that 
hn p R subjects could recognize the cri- 
it ay position without having experienced 
previously, and they could recognize it as 
effectively as Group TO subjects with ex- 
Derience at the criterion location. While the 
s peine does provide support for the 
3» > theory, it is perhaps more important 
to eg it provides strong contrary evidence 
iy ne prediction from Adams’ (1971) the- 
i which says that Group TO with prac- 
at the criterion position should have 
ona accuracy (due to increased percep- 
al trace strength) and should maintain 
accuracy longer than Group IR. These find- 
ings provide considerable difficulty for 


dams' position. 


Separate States of Memory 

There is considerable evidence in the 
verbal learning literature to indicate that 
there are two separate states of memory, one 
for recall in which subjects must produce 
the response term, and one for recognition 
in which subjects must only say whether the 
item has been seen before. (See Under- 
Wood, 1972, for a review of this issue.) The 
Same view is presented in this article and is 
in general agreement with Adams (1971). 
Under this view, recall memory is the state 
that produces the movement, and recognition 
Memory is the state that determines whether 
the subject can recognize that the movement 
Was correct. While there is stron, 
for this view in verbal Jearning, W ae 
evidence for this contention in motor skills! 

A research strategy: Tt was realized long 
ago in verbal behavior that recognition mea 
Sures (in terms of number correct) were 
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usually larger than were recall measures, but 
that this difference, in and of itself, did not 
necessarily mean that the processes under- 
lying recognition and recall were different. 
If it could be shown that a given experi- 
mental variable influenced recognition in one 
way (eg. increased it) and influenced re- 
call in another way (e.g. decreased it), this 
would provide evidence that the two memory 
states follow different laws. If two systems 
have different laws, it makes little sense to 
maintain that they are the same system, and 
finding that some experimental variables 
(e.g. intentional versus incidental instruc- 
tions) affect recall but not recognition has 
been used as support that recall and recog- 
nition are separate systems. 

Motor recognition and recall. The recog- 
nition-recall issue had not been studied in 
motor skills until Adams (1971) proposed 
the independence of these states in his the- 
ory. Since the introduction of this theory, 
however, there have been a few attempts to 
show that certain variables influence recall 
and recognition differently in motor skills. 
One such attempt was made by Schmidt 
et al. (in press), who sought evidence for 
the independence of recall and recognition 
using KR delay as the main independent 
variable. In Experiment I, subjects learned 
to recognize lifted weights, a performance 
reasoned to be based on recognition memory. 
Increased KR delay (25 sec vs. sec) 
caused a drop in the correlation between the 
actual weight and the subject's guess from 
75 to 46, supporting the conclusion that 
KR delay is a variable in recognition mem- 


ory. In Experiment II, the authors used a 
Schmidt & 


rapid ballistic task (as with 
White, 1972), attempting to show that KR 
delay was a factor in the recognition mem- 
ory for the task (as it was with Experiment 
I) but was not a factor in 
The task in Experiment TI involved a par- 
adigm wherein the subject was asked to 
learn to move a horizontal slide 30 cm (with 
follow-through permitted) in 200 msec. The 
subject made a response, the experimenter 
asked for his guess about his error (subjec- 
tive error), and then the experimenter pro- 
vided KR in terms of the actual (objective) 
error. The experimental variable was agam 


the recall memory. 
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TABLE 1 


RECALL AND RECOGNITION MEASURES AS A 
Function or DELAY OF KNOWLEDGE OF 
RESULTS IN ACQUISITION 


Delay interval 


Measure 5 sec 25 sec 
Variable error 18.6 msec 20.4 msec 
Constant errors +14.4 msec +16.3 msec 
Objective-subjective 

correlation 92 61 
N 20 20 


Note. Fi “Some Evidence for the Independence of Recall 

and WRecoguition in Motor Behavior" by R. A. Schmidt, R. 

Christenson, and P. Rogers, Journal of Motor Behavior, in press, 

Copyright 1975 by Journal Publishing Affiliates, Reprinted by 
ermission. 

n Positive values are overshoots. 


the delay of KR. The rationale was that if 
KR is delayed and the proprioceptive feed- 
back received from the movement fades with 
time, the actual consequences (based on 
KR) cannot be paired effectively with the 
faded sensory consequences, and hence the 
recognition schema will not develop effi- 
ciently. Recall, or the Production of the 
movement, is dependent upon KR and the 
response specifications (which, it is assumed 
do not fade with time as rapidly as the sen. 
sory consequences), and hence the theory 
pe predict E the measure of response 
roduction would not i 
ey SES be influenced by the 


with KR 
5 sec) or 
additional trials 
of recall in this 
terms of the 200 
stant error or y 
indicant here, erns the ex. 


ength of the 
ut on the no- 
of the inde. 
Was equated 
h the estab- 
ng the learn- 
of various ex- 
hmidt, 1975). 
d the indepen. 
on when it was 


across groups, in keeping wit 
lished paradigms for determini 
ing versus performance status 
perimental variables (e.g., Sc 

The experiment supporte 
dence of recall and recogniti 
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found that increasing KR delay had no sig- 
nificant effect on recall measures (constant 
error or variable error), but had es 
detrimental effect on the recognition indi- 
cant, the objective-subjective px iy 
As can be seen in Table 1, the 5-sec delay 
A à . ions of .92, whereas 
group displayed correlations o JF tion 
the 25-sec delay group showed a tor a 
of only .61, representing a significant d 
in the error-detecting sensitivity of the long 
delay group. Therefore, KR decay app” in 
to be a variable that causes decrements " 
recognition, but leaves recall unaffected, e 
these findings are in the direction predic 
by the theory, sable in 
If it is true that KR delay is a variable © 
the development of the recognition SC e on 
then the learning of any task dependen {as 
the recognition schema should be retarc e ni 
well. One such task is linear pastor et 
Which, according to the theory, is pem 
on the match between the expected sens ni- 
consequences (generated from the mh 
tion schema) and the incoming proprio 
tive feedback during the movement. mple 
difficulty with all this is that there is 2 
evidence that linear positioning tasks a 
affected by KR delay (see Hilden L^. 
or à review), One possibility 1s tha 
evidence on linear positioning has Bar e 
cedures that prevented the matching 9 ences 
back and the expected sensory e acet 
ecause of the time constraints usually P ‘that 
"pon subjects, and one could -—— 
Subjects may have switched to a St^ T, 


: ts. 
Wherein they program the movemen” “ement 
this c nd 


e 


] 


"T 
ould be dependent upon recall memory gn 


i punt 


: jl hac 
Starting location; but KR delay adi à nas 
emect on learning in this task. New oe 
Pointed out that it is possible that I m pu 
does not affect the recognition sche 

UK ee 

* The author wishes to thank Karl M- 


Providing this suggestion in a person 
communication, October 1974. 


fof 
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rather affects the development of the error- 
ee schema ; if the subject could not 
el the errors effectively in the Schmidt 
et al. (in press) study, we may have incor- 
rectly come to the conclusion that the recog- 
nition schema was impaired. More work 
will be needed to resolve this question. 
Even if the Schmidt et al. (in press) study 
may be inconclusive concerning the recogni- 
tion-recall distinction, there are other data 
that provide somewhat stronger evidence. 
The same basic strategy was used by Newell 
and Chew (1974) and Schmidt and Wris- 
berg (1973), but here the independent var- 
iable was the amount of response-produced 
feedback. In both studies, the presence of 
combined vision and audition was varied in 
the rapid, ballistic task (Schmidt & White, 
197 2), and they showed that vision and au- 
dition had no effect on response production 
in terms of absolute error. But in Schmidt 
and Wrisberg (1973), when KR was with- 
drawn, subjects with vision and audition 
regressed far less in performance than did 
subjects without these feedback sources 
present; the interpretation was that vision 
and audition enabled subjects to generate 
accurate subjective reinforcement via the 
comparison of the feedback with the ex- 
pected sensory consequences from the rec- 
ognition schema. Also, Newell and Chew 
(1974) showed that eliminating vision and 
audition in à transfer test increased the dif- 
he objective and subjective 


ference between t 
errors, a measure taken to be sensitive to the 
strength of the recognition schema. — 

The conclusion from this section 1S that 
variables have been isolated—the quality of 
feedback and perhaps KR delay—that m- 
fluence the recognition memory for motor 
tasks but do not affect measures of the recall 
memory. This, plus the evidence from the 
verbal learning literature, provides support 
for the hypothesized separation of recall and 
recognition processes in motor skills. 
ith Practice 


of the theory 
ease with KR 


Increased Error Detection W 


_ One of the main predictions 
is that error detection will increas a 
Practice, and Schmidt and white (1972) 


and others since then (Newell, 1974; Newell 
& Chew, 1974; Schmidt & Wrisberg, 1973) 


have shown this quite clearly. In the 
Schmidt and White experiment, subjects 
were given 170 trials of the ballistic slide 
task, with a target movement time of 150 
msec. Subjects provided subjective errors 
prior to the experimenter giving them their 
objective errors (ie, KR). The correlation 
increased steadily over the course of prac- 
tice, as can be seen in Figure 4, with the 
correlations being quite low (around .20) 
in initial practice and rising to around .90 
in later practice. These data show clearly 
that subjects increased their sensitivity to 
the direction of their errors with practice, 
and support the notion that the schema 


for response recognition was increasing in 
strength. 


Error Labeling System 

What is the evidence for such a labeling 
system? Subjectively, it seems rather ob- 
vious that such a system must exist, since 
subjects clearly transfer information from 
the neurologically coded sensory input to 
words about the error, and they can do this 
rather well. Schmidt and White (1972) 
found objective-subjective correlations of 
.90, and subjects were able to guess their 
errors within 7 msec of the actual time 
moved on the average. Thus it seems clear 
that subjects can estimate their own errors 
quite well. 

However, additional data arguing for a 
separate status of the two error signals (sen- 
sory errors versus subjective reinforcement) 
were presented by Newell and Boucher 
(1974). They asked blindfolded subjects to 
20 times to a stop, then 
to estimate the movement distance verbally, 
and then to move to the designated position 
once without the aid of the stop. One group 
estimated the distance in millimeters and an- 
other group in inches, with all subjects hav- 
ing had more experience with the inches 
The guesses in inches (transformed 
) displayed nearly half the 
es in millimeters, while 
the accuracy in moving to the target was not 
significantly different between groups. The 
that the sensory signal aris- 
arison of actual and ex- 
identical for the two 


move a linear slide 


scale. 
to millimeters 
error of the guess 


interpretation is 
ing from the comp 
pected feedback was 
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Ficure 4. Mean objective-subjective (O-S) correlation as a function of 


practice. 


groups, but that the more efficient schema 
for labeling in terms of the inches scale re- 
sulted in a more accurate report of the dis- 
tance. These data, then, provide some sup- 
port for the idea that there exist two states 


of error: raw Sensory signals and labeled 
subjective reinforcement, 


Learning Without KR 


The predictions of the present theory for 


no-KR learning differ considerably for rapid 
tasks (ie, with movement time less than 
200 msec) and slow positioning tasks. The 
categories of movement are discussed in 
turn. 

Positioning tasks. The theory hypoth- 
esizes that the subject develops a recognition 
schema over the course of practice with KR, 
and that he uses the comparison of actual 
feedback and the expected sensory conse- 
quences generated on each trial as the mech- 
anism by which he arrives at the correct 
location. The evidence is quite clear that 
without postresponse error information in 
the form of KR (and without Presenting the 
correct movement on each trial), no learning 
can occur without KR; this conclusion js 


Abbreviation : KR = knowledge of results. 
an Error Detection Mechanism in Motor Skills: A Te 
Loop Theory" by R. A. Schmidt and J. L. White, J, 


(From “Evidence for 
st of Adams’ Closed- 
ournal of Motor Be- 
Publishing Affiliates, 


supported by a number of investigations 
(eg., Bilodeau et al., 1959), and the evi- 
dence has been summarized by Bilodeau 
(1966). This is entirely consistent with the 
present theory, since the Sensory conse- 
quences of the correct response are required 
to strengthen the recognition schema, an 
the subject has no way of learning the cor- 
Tect location without the guidance provided 
by KR. 
Adams (1971) suggested that subjects 
could continue to learn these responses with- 
out KR after sufficient KR practice was pro 
vided, during which the perceptual trace 
(the expected sensory consequences in the 
Present terms) was strengthened. The prem 
ent theory does not predict this, and 2 
are no data to suggest that such should pR 
the case in linear positioning tasks. jsp d 
retically, after the recognition schema 15 is 
veloped, withdrawing KR provides a ae 
Or Continuing to respond with the for™ A 
level of accuracy, but any improvement, m 
cept by accident, is not possible. Suo. 
use the recognition memory to produce S 
movement, attempting to match their nt 
feedback with the expected sensory COP 


“aa 
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quences, but as discussed earlier, their sub- 
jective reinforcement indicates that they had 
moved to that position for which they re- 
ceived a match and for which the subjective 
reinforcement was necessarily zero. Hence, 
their subjective reinforcement is not capable 
of signaling their actual movement distance 
on that trial, and the substitution of subjec- 
tive reinforcement for the withdrawn KR is 
ineffective. Hence, after the movement sub- 
jects have the actual sensory consequences to 
store for use in the updating of the recogni- 
tion schema, but they do not have the other 
essential ingredient—the actual outcome. 
Therefore, no further development of the 
schema can occur, and learning does not 
continue after KR withdrawal. These pre- 
dictions are upheld by a number of separate 

investigations reviewed by Adams (1971). 
The reader should take cautiously the con- 
clusion that if postresponse error informa- 
tion in the form of KR is not available in 
such tasks, learning cannot occur, for there 
are two studies that show clearly that learn- 
Ing can occur without KR under certain con- 
ditions. Solley (1956) used a motor-driven 
tilt-chair that the subject could operate by a 
two-way finger switch. The subject was 
firmly strapped to the chair and the head 
was immobilized with a bite-bar. He was 
powered to the vertical position, whereupon 
Then the 


the experimenter said “vertical.” 
experimenter tipped the subject 30? to the 
left or right, and the subject attempted to 
replace himself in the vertical position with 
appropriate switch movements. When the 
subject had achieved what he thought was 
vertical, Solley “randomly” moved him 
about to disguise the relationship between 


vertical and the position to which he had 
actually moved, and then finally moved him 
to vertical again and again said “vertical.” 
The subject was then tipped 30° and at- 
tempted to again replace himself in the ver- 
tical. Over the course of 30 such trials, the 
absolute errors in arriving at vertical de- 
creased markedly and significantly, and this 
Was achieved without ever providing direct 
error information. The crucial difference 
between this study and those referred. to 
earlier jg that the correct location was pre- 
Sented on every trial, and apparently this is 


T6 7-12 
TRIAL. BLOCKS 


Ficure 5. Mean constant and variable errors 
(CE and VE) as a function of practice. (From 
“A Note on Improved Motor Performance With- 
out Knowledge of Results" by C. A. Wrisberg 
and R. A. Schmidt, Journal of Motor Behavior, 
in press. Copyright 1975 by Journal Publishing 
Affiliates. Reprinted by permission.) 


all that is necessary to acquire the correct 
movement. 

Wrisberg and Schmidt (in press) dupli- 
cated the Solley study with a linear position- 
They presented the correct loca- 
having the subject move 
vaited either 5 or 50 sec 
before having him attempt to move to this 
position without the stop. Subjects repeated 
this trial sequence of the correct movement 
plus an estimation 12 times to the same 
target, and KR was never provided as to 
the response correctness. The results of the 
study are shown in Figure 5, where the con- 
stant errors and variable errors are pre- 
sented for two successive 6-trial blocks. 
Both constant error and absolute error (not 
graphed) decreased significantly with prac- 
tice, with less undershooting as a function 
of trials; variable error decreased as well, 
but the differences just failed significance. 
These two studies provide evidence that sub- 
jects can learn slow positioning tasks with- 
out postresponse error information, provided 
that the correct location is provided on each 


trial. Adams and Dijkstra (1966) pro- 
duced similar findings in linear positioning. 


ing task. 
tion on each trial by 
to a stop, and then v 
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The present theory can handle these find- 
ings quite easily. When the subject is pre- 
sented with the correct movement, he stores 
the sensory consequences along with the de- 
sired outcome, and the recognition schema 
begins to be developed. Then when he at- 
tempts to reproduce the movement without 
the stop, he moves to that position he rec- 
ognizes as correct. On the next trial he 
receives the sensory consequences of the cor- 
rect response again, further updates the 
schema, and reproduces the second trial with 
greater accuracy than the first. KR is not 
necessary to learn because subjects received 
the appropriately paired sensory conse- 
quences and desired outcome (the latter be- 
ing a constant) on each trial, and this is all 
that is needed for development of the recog- 
nition schema, 

Rapid movements. The predictions about 
learning without KR in rapid tasks are 
somewhat different from those in position- 
ing tasks. Theoretically, the movement se- 
quences are carried out totally open loop 
in the rapid tasks, with recall memory hay- 
ing a major role in movement production. 
Recognition memory and the resulting sub- 
jective reinforcement occur after the move- 
ment, since these feedback loops are con- 
sidered quite slow. Therefore, in Such tasks 
there is the capability of detecting one's 

errors after the movement so that correc- 
tions can occur on the subsequent movement. 
Even so, however, considerable initia] 


cannot 


predict learning “from Scratch” without KR 


except perhaps in the situations where (a) 
the correct movement is presented in a way 


analogous to the method used in the Solley 


and Schmidt 
task to be learned 
Practiced class of 


1 à strong schema 


eloped. There is 
apparently no evidence concerning the for- 


mer situation, but the latter Possibility has 
Some support from Henderson (1974), who 
showed (with experienced darts players) 
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improved performance in dart throwing 
without any KR or other feedback infor- 
mation. 

After sufficient practice with KR, the sub- 
ject not only should be able to continue to 
perform effectively under KR-withdrawal 
conditions, but also should be able to con- 
tinue to learn the movement. This predic- 
tion, however, has been quite difficult to 
support because of a number of methodolog- 
ical problems. In order that the task have 
strong recognition memory, it should pre- 
sumably be simple enough that the recogni- 
tion memory can be developed within the 
time constraints of the typical experiment. 
However, with such simple tasks, the 1m- 
provement in performance is quite rapid, 
with nearly asymptotic conditions reached in 
very few trials (eg., Schmidt & White, 
1972; Schmidt & Wrisberg, 1973). If KR 
is now withdrawn, subjects have no room 
for improvement, because errors are already 
at an effective floor in the task, Thus, at- 
tempts to demonstrate continued learning 
without KR have been almost universally 
unsuccessful, An example is the Schmidt 
and White (1972) study in which KR was 
withdrawn after 150 trials in a 150-msee 
movement-time task ; 
indication of a contin 
In absolute errors, the 
and was not statisticall 

However, an alternative approach has 
Proven useful. Schmidt and McCracke? 
(unpublished study, 1974) allowed subjects 
to perform 200 trials of rapid movemen 
with a 200-msec movement-time criterio! 
with KR on one day, and then broug” 
subjects back 1 week later, providing e 
additional trials with no KR at any pan 
hrough the forgetting that would pe 
Over the 1-week interval, subjects could We 
turn to the task with elevated error, @ i 
1ng some room for subsequent decreases, i h- 
thus the study investigated relearning v iod 
out KR. The 20 trials on the no-KR P^ e 
Were grouped into blocks of 5 trials a? ab- 


a 


r : m 
presented in Figure 6, The decrease ere 


while there was some 
uing downward tren 
effect was very sma 
y reliable. 


Solute error was Significant, but apa e 
no significant changes in either const? wr 
ror or variable error, While these ce 


a 
Ward trends were significant, the erro" * 


| 
| 


ES 2 this period was considerably larger 
E, : was at the end of initial KR practice, 
E 1atever relearning took place was quite 
there — rather incomplete. Nevertheless, 
i ata do suggest that relearning can 
hes IRURE KR when initial KR practice 
een provided (although, strictly, à 
Warm-up interpretation is possible), and 
they support the theoretical expectations 
about no-KR learning. 
e interpretation is that the recognition 
recall schemata were retained to some 
extent over the 1-week layoff, but that they 
both suffered considerable weakening. The 
d on the initial no-KR trial was quite 
arge, suggesting that the choice of the re- 
sponse specification was degraded by for- 
Setting. Even though the schemata had 
undergone forgetting, it appeared that suffi- 
cient subjective reinforcement was available 
5 allow improvement over the next 19 trials. 
ias d work needs to be done using more 
ensive practice with KR, and with more 
no-KR trials to determine if this prediction 
might hold more strongly if given the 
chance. 


Response-Produced Feedback and 
Recognition 


The present theory hypothesizes that the 
development of the recognition schema, and 
hence the capacity to detect errors either 
during (slow responses) or after (rapid re- 
sponses) the movement, is dependent upon 
the pairing of the actual outcome (based on 
KR) with the sensory consequences of the 
movement. Attempts to manipulate proprio- 
ceptive feedback in motor tasks have been 
fraught with difficulty arising from either 
unknown effects of direct degradation of 
feedback, as in the “cuff technique” (Laszlo 


& Bairstow, 1971), or by the confounding 
ics of controls 


effects of changing the dynam! 

(eg., Bahrick, 1957). If the manipulation 
of feedback can be argued to change the 
ired of the subject, then one 


movement requ ) 
aded error detection 


cannot be sure that degr 
after such treatments was due to the effects 


on the recognition schema or due to the fact 
that subjects had been attempting to learn 
a different task. Given these limitations, 
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Mean absolute error in relearning 
f results (from Schmidt & 
d study, 1974). 


FIGURE 6. 
without knowledge o 
McCracken, unpublishe 


however, there are à few studies that link 


feedback and error detection. 

Within the framework of Adams’ (1971) 
theory, a number of authors attempted to 
show that error detection and error correc- 
tion were enhanced by the enrichment of 
feedback and practice. Adams & Goetz 
(1973) showed that providing vision, audi- 
tion, and increased force cues in linear posi- 
tioning (as compared with no vision and 
audition and minimized force cues) in- 
creased accuracy in detecting and correcting 
Practice was effective in increasing 


errors. 
the recognition processes as well. Adams, 
Goetz, and Marshall (1972) have provided 

The interpretation is that 


similar findings. 
the recognition schema us 
tive information to form t 
enhancing this information ren 


more effective. 
With more rapid movements, Schmidt & 


Wrisberg (1973) showed that eliminating 
vision and audition had no effect on per- 


formance when KR was present, but that 
when KR was withdrawn after 20 KR trials, 


the groups without vision and audition re- 
their performance to a greater 
a groups with these information 
channels present. The implication was that 
KR practice when vision and audition were 
present permitted the establishment of a rec- 
ognition schema based on these channels, 
and that no-K R practice then allowed sub- 


jects the opportunity of using subjective re- 


es the propriocep- 
he rule, and that 
ders the rule 


gressed in 
extent thar 
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inforcement based on vision and audition; 
groups without vision and audition in KR 
practice regressed more because of less ac- 
curate subjective reinforcement. An inter- 
esting finding was that while earlier experi- 
ence with vision and audition had strong 
effects on KR withdrawal performance after 
20 trials, there were no such effects after 75 
trials, suggesting that the channels of infor- 
mation used by the recognition schema may 
change somewhat as practice continues; vi- 
Sion and audition may have been important 
in earlier error-detection performance, and 
perhaps proprioception had become more im- 
portant later. There is certainly more room 
for experimentation on this question. 


Rote or Errerence Copy 


An important line of thinking in motor 
skills research today is the notion of effer- 
ence copy. There are a number of points of 
view about how it might operate and the 
issues are definitely not settled. The role 
of efference in the present theory, therefore, 
is somewhat unsettled also, and this section 
attempts to review some of the important 
issues and to explain the thinking that led 
to the present position taken on efference. 
Space does not permit a thorough treatment 


of this area, but for reviews consult Gyr 
(1972) or Jones (1974). 


Evidence for Efference 


The notion of efference Copy be 
the eye-movement work of H 
and later Holst (1954). Th 
gued that for accurate visua 
Was essential that the centr 
have information about 
Sent to the muscles tha 
Without this information, 
are ambiguous, in that the 
know whether a shifting vis 
that the eye was moved in 
ment or whether the eye w 
moving environment. 


(1954) to propose t 


Was stationary in a 
This finding led Holst 


hat an image of the 
motor commands (an efferent Copy or corol- 


lary discharge) is “fed forward” to modify 
the incoming visual sensations to Sigur. 
sate for the fact that the eye was ordered to 
move. This contention has been Strengthened 


N 
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recently by neurological evidence that such 
pathways exist (e.g., Chang, 1955; Li, 1958). 

From this basic idea for visual perception, 
a number of other versions of the idea were 
created, and the application was made to the 
perception and production of movement. 
One major problem with the idea of effer- 
ence for movement control is that the same 
term has been used to represent a nome 
of very different ideas. There are at leas 
three ways that the term has been used 1 
the literature. 

Efference as a feed-forward process. The 
most general, and perhaps most widely r 
cepted, version of efference copy is that ! 
consists of information that is fed forwar 
before or during a movement that prepares 
the system in some way for the -— 
various response-produced feedback signals. 
The feeding forward of gamma efferent 1n 
formation to the spindles so that the move 
ment is carried out as ordered, and the in^ 
formation fed forward so that accurate Vi- 
sual perception can occur are examples ° 
this kind of idea, Generally speaking, SUC? 
processes are thought to occur at a numbet 
of levels within the central nervous syste! 
There is no necessity that the informatio” 
fed forward be a literal copy of the motor 
commands sent to the musculature, as with 
the following two models ; 

The Holst idea, In the original statement 
of this model, a litera] copy of the motor 
commands sent to the eye muscles is fed a 
Ward to modify the incoming visual inform? 
tion from the retina (Holst, 1954), enabling 
the accurate interpretation of the visual s 
nals. The idea has been extended to mor 
ment control by assuming that an eter 
Copy of movement commands is comp Sd 
against the inflow of proprioceptive E 
back, with resulting mismatches rep 
Ing an error in the movement. Jones t: pe- 
has termed this idea the “inflow model rio 
cause it depends upon the inflow of proP 
ceptive feedback. 

One problem with this inflow mode 
that the Proprioceptive feedback an s quit 
ence copy are in different “languages of 
proprioceptive feedback coded in term yed 
the movements of joints, and efference scl” 
im terms of the commands to the mU 


tg 
er^ 


| 
| 


255 


Moror SKILL LEARNING 


ture. With the information in different 
sg how, strictly speaking, could these 
red age match? A solution often 
throughout socie ae uod ins 
Cie cen qe. ystem, pr this begs 
7 yu n, as it does not provide testable 
ssertions about how such recoding is ac- 
complished. i 

A second problem, though, concerns the 
fact that such a mechanism cannot provide 
information to the subject about the extent 
to which he has achieved the environmental 
goal for the movement; it can only signal 
the extent to which the chosen movement 
was: carried out as planned. As discussed 
earlier, this type of model cannot account 
for the evidence dealing with the detection 
of errors in movement (eg. Schmidt & 
White, 1972) or the relearning without KR 
(Schmidt & McCracken, unpublished study, 
1974). 

The Jones idea. Largely because the in- 
flow model depends on proprioception, Jones 
(1974) rejected this notion and argued in 
favor of what he terms the “outflow model” 
where a literal copy of the efferent com- 
mands is monitored centrally without com- 
parison with proprioceptive feedback. Ac- 
cording to the argument, knowing what the 
limbs were commanded to do (and knowing 


that the limbs will carry out these com- 
is sufficient information to be able 


mands) 
to determine where the limbs are after the 
movement. Hence, this version of efference 


has been used to explain the perception of 
limb locations in space. This basic idea, 
came from the eye-movement work 
where the loads on the eye musculature were 
nearly constant, permitting one to have in- 
formation about the eye’s location from 
knowledge of the efferent signals (e.g., 
Festinger & Canon, 1965). But the loads 
on the skeletal musculature are not nearly 


as predictable as they are in the eye, and it 
limb location 


is difficult to imagine how 
could be sensed solely from the record of 


the efferent commands. . 
In spite of the argument concerning the 


unpredictability of limb loading, some recent 
evidence has been interpreted in terms of an 
outflow model, because of the apparent dif- 


ficulty of feedback mo for the 


however, 


dels to account 10 


findings. These concern (a) the learning of 
movements in deafferented animals (e.g. 
Taub & Berman, 1968), and (b) the correc- 
tion of movement errors in less than one 
reaction time (eg. Angel, Garland, & 
Fischler, 1971). 

Taub and Berman (1968) used monkeys 
that were deafferented from the head down 
in a shock-avoidance experiment. They pre- 
vented the animals from seeing their limbs 
by the use of a large collar extending out- 
ward from the neck, secured them into à 
chair-like structure, and taped a pneumatic 
bulb into the palm of one hand. When a 
shock was administered to a normally af- 
ferented portion of the head, the animals 
learned to turn off the shock by squeezing 
the bulb. The fact that the animals could 
learn a hand movement without any feed- 
back from the periphery was surprising to 
many theorists, and the leading interpreta- 
tion has been in terms of the outflow notion. 
According to the argument, (a) feedback is 
essential to learning, (b) there was no sen- 
sory feedback, (c) outflow or efference copy 
could provide information about the com- 
mands issued, (d) therefore, outflow was 
the mechanism that mediated learning. 

The second line of evidence has used two- 
choice step-tracking tasks, in which the sub- 
ject moved in one of two directions to @ 
target as quickly as possible after a stimulus. 
Occasionally, the subject would make an 
error, in that he would initially move in the 
improper direction. When this happened, 
he was to reverse his direction as quickly 
as possible, and move back in the correct 
direction. Angel and his colleagues (Angel 
et al., 1971; Angel & Higgins, 1969; Hig- 
gins & Angel, 1970) and Megaw (1972) 
found that the latency of these corrections 
(the time from the first movement in the 
wrong direction until the subject began to 
correct his movement) was sometimes very 
short (from 60 to 90 msec), clearly less than 
Because feedback from 
the periphery is too slow to cause these cor- 
rections, it was reasoned that subjects de- 
tected their own errors before the movement 
was begun through monitoring of efference 


one reaction time. 


copy: 
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The present theory sees efference copy as 

necessary to provide two kinds of informa- 
tion. First, the subject must know that he 
has executed a motor command so that the 
information he receives from the periphery 
can be interpreted as resulting from active 
rather than passive movement. Without be- 
ing at all specific about the details, we as- 
sume the existence of some central mecha- 
nism that records that a motor program has 
been run off; no literal copy of the muscle 
commands is assumed to be stored. Second, 
the notion of efference as a feed-forward 
process is assumed in the present theory. 
Before the movement, the expected sensory 
consequences are generated from the recog- 
nition schema, and this information is fed 
forward so that the incoming response-pro- 
duced exteroceptive feedback and proprio- 
ceptive feedback can be evaluated properly, 
resulting in the detection of error. In addi- 
tion, the motor program output is thought 
to contain gamma efferent signals that are 
fed forward to the muscle spindles, prepar- 
ing the local musculature for reflex-based 
corrections that ensure faithful execution of 
the program. Further involvements of ef- 
ference copy are not proposed in the present 
theory. „The evidence cited above apparently 
supporting the outflow model (ie., learning 
with deafferentation and rapid error correc- 
tion) can be handled with the present theo- 
retical structure, 

In the deafferentation Studies, the mon- 
keys had all that was necessary to learn the 
avoidance response without efference. When 
the shock came on, the animals began a 
series of random movements in an attempt 
to escape. Eventually, quite by accident, the 
animal squeezed the bulb and the shock 
ceased. According to the present theory, at 
this time the animal h : 


[ i ad stored both the re- 
sponse specifications (for the random move- 


ments ordered) and the actual outcome (the 
cessation of the shock). Which specification 
was associated with shock cessation was not 
clear, however, since many responses were 
being ordered at nearly the same time, but 
the schema rule notes the relationship be- 
tween the various response specifications and 
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the cessation of the shock. On the next 
shock trial, the many response specifications 
related to the desired outcome were ex- 
ecuted, and eventually the hand squeeze oc- 
curred again, strengthening the relationship 
between the response specifications and the 
outcome. Hence, the animal could learn 
without direct feedback from the limb, be- 
cause he had the essential ingredients on 
each trial to form the schema rule: the ac- 
tual outcome and the response specifications. 
The logic presented here is similar to 
Guthrie’s (1952) notion of stimulus-re- 
sponse contiguity. The notion of efference 
Copy as a central feedback loop indicating 
the limb movements is not necessary to ex- 
plain these findings. 

The error correction studies can also be 
explained by the present theory without 1m- 
volving efference. In these studies, the re- 
action times for correct responses (the inter- 
val from stimulus until first movement) was 
greater than for incorrect responses, Sug- 
gesting that subjects were anticipating 0n 
error trials. Theoretically, on an error trial 
the subject incorrectly guesses the direction 
of the upcoming movement and initiates TC 
sponse specifications in advance, with actua 
movement not beginning until after the stim- 
ulus. Before the program is initiated, the 
Subject generates the expected sensory con” 
Sequences, which include the expected t 
the guess is Correct) vision from the stim- 
ulus array. When the (unexpected) stir 
ulus comes on, the incorrect program is al- 
ready being run off when the subject Te 
ceives a mismatch between the expected gre 
actual sensory consequences, When the n1*" 
match occurs, a visually based correctio" 
begins and the program for the corten 
movement is initiated. However, we 
the first (incorrect) program must run} 
Course for one reaction time or longer: aut 
beginnings of the incorrect movement yc 
before the correction. This explanation Lie 
dicts that the time from the stimulus pe 
the first movement in the correct die 
should approximate one reaction time. Th 


+ gini 
values were 367 msec (Angel & Hige 96 
1969), 292 msec (Megaw, 1972), auc - 


msec in a recent study by Gordon (19 


ee ee aaa 


a a d 


all of which are well within the normal range 
of reaction time. i 
pos Te ros Pa uses efference 
Por fane inna that informs the sub- 
End rom motor program has been initiated, 
E nes ots various feed-forward pro- 
rr" i eeding forward the expected pro- 
Eso = and exteroceptive feedback is es- 
dnd th or the accurate detection of errors, 
Pda: p of gamma efferent signals 
Bon pios musculature for subsequent ac- 
aud dere is no place in the theory, a5 
echar y stated, for efference copy as a 
P Me that can allow the subject to 
eben ne errors in terms of reaching the 

= goal, because the current evidence 
can be explained without resorting to such 
à notion. 


PossinrLg FUTURE DIRECTIONS 


7h aile the present theory can solve a num- 
Doi problems with existing theoretical 
mes of view and has strong empirical sup- 
E anme areas, the evidence for it is 
There F generally lacking in other areas. 

are numerous directions 1n which one 
could move, including asking about individ- 
ual differences in schema formation, how 
developmental influences are manifested, and 
how large a given schema might be in terms 
of the range of movements encompassed by 
it. But the answers to some essential ques- 
tions should be sought first to determine 
whether the theoretical ideas presented here 
are worthy of further work. This section 
deals with some of these questions. 

First, we need stronger evidence for motor 
recall and motor recognition schemata. We 
might take comfort that the schema notion 
seems acceptable enough for the visual pat- 
tern-recognition literature, but motor skills 
researchers have been embarrassed before by 
taking the work in verbal learning and ap- 
plying it to motor behavior without a serious 
research effort to determine whether the ap- 
plication is reasonable. A number of meth- 
ods of testing the schema notion for motor 
behavior have been mentioned, including evt 
dence that transfer from a variety of experi- 


ences to a new instance of the same class 1S 
high, that increas in practice 


ed variation 
should increase transfer, and that responses 
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presumed to be governed by the same motor 
program share in temporal and spatial orga- 
nization (e.g., Glencross, 1973). The notion 
oi the schema is clearly essential to the 
theory, and it seems logical that the cred- 
ibility of this idea should be strongly tested 
initially. 

It is possible that it wil 
demonstrate schema learning in adult sub- 
jects, and perhaps a more profitable popula- 


tion would be children. Many researchers 


adhere to the view that most of motor learn- 
d that adult learn- 


ing occurs in children an 
merely a recombination of the old 
it would therefore make sense to 
sition of these new skills at 
develop most rapidly. 
te schema learning in 


adults does not necessarily mean that sche- 
mata do not exist, since the schemata for 
certain classes of highly practical activity 
(such as throwing) could have been devel- 
oped years earlier. 

Another important prediction made by the 
theory is the possibility of learning rap! 
tasks without KR after some initial K 
practice. This has not been shown clearly 
so far, and additional work should be con- 
ducted with longer KR practice periods, 
with tasks in which complexity is higher 
than in the simple slide-movement tasks, and 
with more extended no-KR practice periods. 
In addition, the predictions about tasks in 
which accuracy is not a goal, as well as the 
predictions referring to open and closed 
tasks, need to be tested. 

A final word concerns potential applica- 
bility. While the theory was certainly not 
developed for application to the teaching of 


motor skills, the predictions seen to have a 


great deal in common with the notion of 


movement education. While there are many 
branches of this idea, one of the important 
thrusts is that children should engage in ac- 
tivities stressing variety in movement pat- 
terns (e.g, jump over an object in as many 
ways as possible). Movement education 
people do not talk of schemata, of course, 
but one interpretation of their method is that 
it develops schemata in children. These 
schemata will then be useful in an open-skills 
situation where a jumping response is re- 


] be difficult to 


ing is 
habits ; 
study the acqui 
the age when they 
Failing to demonstra 
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quired. This area has been devoid of theory 
for a long time, and perhaps the present 
views can be of assistance. 
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memory system that results from the per- 
ception and encoding of an event. It con- 
Stitutes a necessary condition for the sub- 
Sequent retrieval of information about the 
event, A long-standing problem in the study 
of memory concerns the nature and prop- 


erties of memory traces. 
In a typical laboratory situation a person 
Sees or hears à familiar word, which he is 
asked to store in memory. The perception 
of the word-event produces a change in the 
system, that is, it creates a memory trace. 
How do we conceptualize the trace of the 
word-event? How do we define it? How 
do we describe it quantitatively? And how 
do we relate our conceptualization, defini- 
tion, and description of the trace to other 
ideas about memory ? 
This article suggests some possible an- 
swers to these questions by proposing a 
memory 


simple descriptive theory of the n 
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successively probing the target 
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method, and hence the general theory, 


remembered material. A demonstra- 
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Some potential crit- 


ace elements. 
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ieval, and (c) quan- 
of valences (ef- 


defined as relations bet 
and the products of retr 


titatively described in terms 
fectiveness) of retrieval cues and of the 


interrelations of cue valences. The theory 
is embodied in a set of experimental and 
analytic procedures for specifying the struc- 
ture of a memory trace. 

Our conceptualization of memory traces 
as aggregates of elements is in keeping with 
currently popular notions of concepts and 
words being represented in memory as bun- 
dles of attributes or features (e.g. Anisfeld 
& Knapp, 1968; Bower, 1967; Morton, 
1969 ; Underwood, 1969). We discuss trace 
elements, rather than features and attributes, 
solely for convenience in terminology. 

We will not attempt to relate our trace- 
element approach to the other major theo- 
retical tradition in characterizing memory 
representation, namely the associative net- 
work approach (e.g. Anderson & Bower, 
1973; Kintsch, 1974; Rumelhart, Lindsay. 
& Norman, 1972). Our objective here is 
not to present a comprehensive review and 
sis of the accomplishments of the var- 
memory representation ; 
be and discuss one pos- 
f specifying 


analy 
ious theories of 
rather, it is to descri 
sible approach to the problem © 
properties of memory traces. 
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262 ENDEL TULVING AND 
TRACES AS RELATIONS BETWEEN 
OBSERVABLES 


Since memory traces cannot be observed 
directly, they must be described indirectly, 
that is, in terms of things that can be ob- 
served. Three classes of observable entities 
are always involved in remembering: (a) 
the input into the memory system, (b) the 
retrieval query or cue, and (c) the output 
from the system. The relation between the 
input and the output was for a long time 
considered the only source of information 
about the memory trace. For instance, the 
correlation between some variable X (e.g., 
frequency, recency, etc.) and the probability 
of recall or recognition has been regarded 
as evidence that trace strength is a function 
of variable X. Thus, a property of the 
trace, its strength, has been defined in terms 
of the input-output relation. 

There are at least three difficulties with 
the idea of traces as input-output relations. 
First, it encourages the notion of the trace 
as a copy of what has been presented to the 
memory system. The copy theory, even if 
rendered more plausible by assuming that 
the trace is but an incomplete or impover- 
ished counterpart of the original object or 
event, seems intuitively unreasonable. 

Second, a definition of the trace as an 
input-output relation cannot cope with the 
fact that the output from the system varies 
not only with the input but also with the 
conditions under which retrieval is at- 
tempted, _ This difficulty is usually avoided 
by observing the consequences of input only 
under invariant retrieval conditions, Unfor- 
tunately, this Strategy precludes the study 


of some of the most interestin aspect 
S 
i g pects of 


The third weakness o 


f the input- 
relation as Put-output 


aces is 
Tt pre- 


: input conditions 
the trace of an item, If the trace is dead 


as a relation between input and Output, its 
characterization must include a description 
of input conditions. Therefore, the relation 
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between input conditions and the trace 1s 4 
matter of logic, rather than experiment. 

What we would like to have is some way 
of defining traces that would (a) permit us 
to arrive at a picture of the memory of an 
input event that is not a copy of the event, 
(b) reconcile the idea of a fixed trace with 
the fact that its activation will depend on 
the retrieval environment, and (c) foster 
questions about the relation between input 
conditions and the trace of an item whose 
nominal identity is held constant. 

These objectives can be achieved if mem- 
ory traces are defined in terms of the re- 
lation between conditions and products of 
retrieval, or as "a set of question-output 
relations" (Tulving & Bower, 1974, p. 294). 
The basic ideas are simple. Rather than 
thinking of the trace only as an aftereffect 
of the initial registration of an event, We 
think of it as a “cognitive blueprint” that 
specifies the conditions under which the re- 
collection of the event will occur, Then, by 
assuming the encoding specificity principle 
(Tulving & Thomson, 1973)—the principle 
that any retrieval cue is effective only to 
the extent that its informational contents 
match the information contained in the trace 
of the event—we can interpret the observed 
effectiveness of different retrieval cues as 
evidence about the properties of the trace. 
Thus the definition of the trace in terms of 
the relation between the queries directed at 
the system and the output from the system 
leads to the description of the trace prop- 
erties in terms of the effectiveness of various 
retrieval cues in producing recall. , 

Such an approach abates some of the dif- 
ficulties inherent in the definition of memory 
traces as input-output relations. The de- 
scription of the trace can emerge as some- 
thing other than a copy of the input, since 
effective retrieval cues “may be qualitatively 
quite different from the original event. Als? 
there would be no logical paradox about the 
fixed traces having variable manifestations: 
since the logic of the approach includes the 
assumption that the output from the system 
depends on the retrieval information ce 
cue) as well as on the stored informatio" 
(the trace), Finally, this approach encour 
ages questions about relations between inp" 
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E and the resulting traces of one 

e same item. 

deer o part of this article consists of 

n d n and illustration of this ap- 

o " bo. we will first briefly consider 

Sd urrently popular approaches to the 
em of describing memory traces. 


Traces As FEATURE BUNDLES: 
Two Extant METHODS 


Ben different. methods have been used 
a Loy 10us investigators in attempts to get 
tualiz canens of memory traces concep- 
eviews, as feature bundles. In a recent 
E w, Tulving and Bower (1974) dis- 
ussed 11 methods, and interested readers 
may wish to look at that review for a fuller 
background for the present discussion. We 
will mention here only two of the methods— 
those with which ours might be most di- 
rectly contrasted. 

One is Wickens's 
of attribute shifts, 
three or four successi 
tests with the Brown-Peterson technique, 
with the study items sharing some particular 
perceptual or conceptual feature. Recall is 
typically found to decrease rapidly across 
the successive tests. A “critical” test is then 
given, in which the study items do not share 
one or more attributes that the items of 
previous tests had in common. The critical 
test often yields a high level of recall, com- 
pared with that for a control test in which 
no attribute change takes place. Wickens 
has argued that such a shift effect indicates 
that the systematically varied features were 
encoded into memory. 


Although W ickens's attribute-shift method 
Jot of interesting data, it does 
of problems. One is that it 
ite limited applicability. 
d only within the con- 
Brown-Peterson 
cific condi- 


(1970, 1972) method 
which involves giving 
ve short-term recall 


paradigm and under I? gape k 
tions of retention and intertest bes : ^ a 
i nethod 

1S own whether the 1 o 
eg composition of 


d to measur! 


extende 
sorts of inpu 


traces of other 


vals measured in minutes. x. + i 
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specific information to the subject before the 
attribute-shift effect is obtained—the change 
in the attribute is not always sufficient 
(Gardiner, Craik, & Birtwistle, 1972). Thus 
we are faced with the difficulty of deciding 
exactly what kind of retrieval information 
should be provided to identify trace features. 
Still another shortcoming of the method lies 
in the fact that the investigation of each po- 
tential encoding feature involves the use of 
a special set of items, a set which, as Under- 
wood (1972) has pointed out, could serve 
to "prime" the feature under investigation. 
It follows that we cannot be certain that the 
dimensions of encoding suggested in the at- 
tribute-shift data have any generality beyond 
the precise conditions of the paradigm. Fi- 
nally, the attribute-shift method provides no 
easy solution to the problem of determining 
the extent to which any two types of attri- 
pute shifts reflect the operation of a single 
underlying process. This is an aspect of the 
problem of establishing the relative depen- 
dence of trace features, a problem that oc- 
curs in many extant methods of describing 
traces (see Tulving & Bower, 1974). 
Wickens's attribute-shift method is based 
on the implicit definition of the trace as an 
input-output relation. Although the input 
conditions in the method are specified in the 
form of differences between sets of to-be- 
remembered materials, and the output is in- 
dexed by a derived measure involving 4 
comparison between two conditions, the ef- 
fects of an attribute shift, and more pat- 
ticularly the ranking of the relative effec- 
tiveness of several attribute shifts (e.g 
Wickens, 1970, Fig. 7; 1972, Fig. 3); do 
provide a picture of the trace as 2 relation 
between input and output. And, as we have 
seen, such a formulation has serious limita- 


tions. 
The other method with which the theory 


and technique to be developed in this article 
can be contrasted is the method of false posi- 
e recognition errors. It was first described 
by Underwood (1965) and has been used as 
4 tool for identifying component features of 
memory traces by ae p ks 

1968) a5 well a5 many others eg. acl 
à Underwood, 1970; Elias & Perfetti, 1973; 


Kosslyn & Bower, 1974). A person studies 


tiv 
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a list of items (words, sentences, or the 
like), and later takes a recognition test. Be- 
side copies of study items and unrelated dis- 
tractors, the test includes items that share 
particular features with study items. Thus, 
if one of the study items was the word 
CHAIR, the test might include the seman- 
tically related word table and the 
tically related word hare. To the exte 
the person checks these distractor items as 
“old” more frequently than unrelated dis- 
tractors, the features common to the study 
and the distractor items are implicated as a 
Part of the trace of the study item. The 
features are usually designated by such terms 
as “associated words” or “thyming words.” 

The method of false Positive recognition 
errors is based on an implicit definition of 
the trace as a query-output relation. It is 
therefore free of the special difficulties in- 
herent in input-output definitions of the 
trace. But it does have certain other diffi- 
culties. For instance, the experimenter gains 
information about trace features only to the 
extent that subjects make false positive “er- 
rors.” Hence, the evidence for trace features 
depends, among other things, on the factors 
determining the frequency of such errors; 
yet some of these factors, such as the ratio 
of target to distractor items, Probably have 
little to do with the nature of the trace. An 
oblem is that under 
e subjects Strive 
the data that can 
about trace prop- 


tating the testing 
results, 


acous- 
nt that 


why 
actor 


control words in 


about the relations 
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A third problem has been discussed by 
Tulving and Bower (1974, p. 280) and will 
be only briefly mentioned here. Inferences 
about trace features require assumptions 
about features lost from the trace rather 
than features present in the test item. Tf the 
subject checks the distractor word table after 
having seen curam in the list, he presumably 
does so because of an overlap of semantic 
features between fable and cram. But this 
overlap should only produce an “old” re- 
sponse to the extent that other features that 
normally distinguish table and crater have 
been lost from the trace of crar or are not 
extracted from the test stimulus table. It is 
not clear what kind of theory is needed here 
to relate the empirical data to the hypothet- 
ical memory trace, 

Finally, there is again the problem con- 
cerning the relations among features, their 
independence and separability. Assume that 
the word cram has been presented as a to- 
be-remembered item and that in the recogni- 
tion test two distractors related to CHAIR, 


fable and sit, are both checked as false posi- 


tives. Does this mean that the trace of 
CHAIR includes one 


: feature (e.g., associa- 
tive”) or two features (e.g., “furniture” and 
activity-instrument”) ? To rely on the ex- 
Perimenter’s intuitions about the relation be- 
tween two types of distractor items is not 
Very satisfactory, Even where two items 
may appear to be very different (as with 2 
semantically related 


^ word such as table an 
an acoustically related word such as hare)» 
there is little to ju 


stify the conclusion that 
they have nothing in common, Evidence 
between different types 
hould be derived with 2? 
, and so far no such pr?" 
oposed. 


RETRIEVAL CurxG AND THE REDUCTION 
METHOD "m 
Retrieval Cüing as a method for describi? 
memory traces has been briefly discussed - 
Tulving and Bower (1974). It has " 


of distractor items s 
objective Procedure 
cedure has been pr 


pott 


the test, selected to be unrelate 


used as a tool for gaining information 
items, are checked as “old” su 


© composition of memory traces in 2 


mum 


y ks 
Seests that the ber of experi relay, Bray. 

^ quie xperiments (eg, Barclay, 974; 
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epe 1974), primarily by providing 
Ie pre possible trace features or attributes. 
E article, in essence, describes an 
The o~ of the method of retrieval cuing. 
maiis ae consists in the successive 
differ g of a given trace with two or more 
Ciferent types of retrieval cues. By sub- 
tae bai: results of such a procedure to à 
lion analysis that we call the reduction 
Es a, the relations among the retrieval 

es with respect to the trace may be ob- 
And in the context of our theory, 


served. 
i » $ - P 
E: observed cue relationships describe the 
ructure of the referent trace. 
oding 


Central to our approach is the enc 
specificity principle mentioned earlier : A re- 
trieval cue is effective to the extent that its 
ronte oal contents match the informa- 
ee contents of the trace. Thus we can 
: er informational properties of a particular 
à NE quantitatively specify its constit- 
A race elements by measuring the effec- 
tiveness of different retrieval cues. 

, At this point it is useful to introduce the 
idea of the valence of a cue, or more pre- 
cisely the valence of a cue with respect to 
Some particular trace. In fact we will define 
three types of valence: gross valence (often 
abbreviated to valence), reduced valence, 
and common valence. The gross valence of 
a cue with respect to the trace of a certain 
event refers simply to the probability with 


which that event can be recalled in the pres- 
In typical experimental 


ence of the cue. 
situations where the subject's task is to re- 
produce the name of a word-event, gross 
valence refers to the probability of recall of 
the word in the presence of the cue. The 
valence will therefore assume à value be- 
tween Zero and pe T valence E "im 
„ays be specified, at least implicitly, 
must alway : me defined trace, for the 
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and cue Y, refers to the probability that the 
trace can be retrieved by cue X as well as 
by cue Y. It follows that the gross valence 
of cue X equals the sum of the reduced (by 
cue Y) valence of cue X and the common 
valence of cues X and Y. 


Successive Probes 


Consider now a simple experimental situa- 
tion in which a subject is shown a familiar 


word among others in a to-be-remembered 


list. After the presentation of the list the 
subject is given two temporally separated 
cues for the target word in question, and 
asked to use the cues in recalling the target 
word. For instance, if the target word is 
cHAIR, the subject could be given the ex- 
tralist cue table, perhaps with a statement of 
the conceptual relation of the cue to the 
target word. Thus, the complete cue might 
be “table (associated with) = Some 
time later in the test the subject is given 
another cue, such as “hare (rhymes with) 
” Of course, additional probes for the 
same target word could also be given, but 
we will consider only the situation in which 


two probes are used. 

If these simple operation, 
word and probing it with tw 
cues are replicated many times, 


s of presenting a 
o successive 
frequencies 


of various possible outcomes can be used to 
estimate the valences of the cues. To do 50, 
of a 


the data are summarized in the form 
2 x 2 contingency table. One such data ma- 
trix from an imaginary experiment is shown 
in Table 1. Here one cue, X, was always 
presented as the first probe, and another cue, 
Y, as the second. The data in Table 1 tell 
us that cue X effected recall of the target 
item 45% of the time, and cue Y 60% of the 
time. We can also see that 40% of the time 
the target word was recalled to both cues, 
5% of the time only to the first cue, and 


hoan o 
with reference to $0 : 
val fa cue will vary widely for traces n 
valence OF * event—even among 20% of the time only to the second cue. 
of different types 9 «| of the same Using our terminology, we can say that the 
involving the presentation S l of cue X was -45 and of cue Y 
events invo ving onditions- Reduced goss valence us ne ot aped] 
iem under different € alence that has been was 60, ie ne Me e E 
va e Y was 40, 2 a aler 
valence refers t0 * ne A e valence of and ^s "educed by cue Y was 05, while the 
iher Cue- ) 
reduced by some other *'y is gwen py the cue ^ Te ence of cue y was 20. “a 
Cue X reduced bY cue pn cece’ to reduceo e uit encoding specificity 
X. iarget C2! WR 1 : es 
Petitur Tem ] called to cue Y.* bs apie vt cegasd the pattern oi anew 
e reca " minc 
ue X and cannot he chus, h as € X m 
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TABLE 1 


Gross, COMMON, AND REDUCED VALENCEs OF Two 
Curs: HYPOTHETICAL DATA 


Second cue Y 


First cue X * = Total 
"E 40 05 45 
as .20 35 o5 
Total 60 40 


lences shown in Table 1 as describing the 
trace of the target item. In other words, 
we have described the trace here in terms of 
the gross, common, and reduced valences of 
the two cues. Since the common and re- 
duced valences are not predictable from the 
gross valences, the whole pattern of data 
tells us more about the memory trace than 
do the gross valences alone. Indeed, we 
might say that the pattern of the two va- 
lences tells us something about the structure 
of the memory trace. 

To formalize the distinction between va- 
lences of cues and the properties of the trace, 
we will refer to components of the trace as 
elements and define them in terms of re- 
trieval cues. Thus, Table 1 can be said to 
depict a trace containing 40% XY elements, 
5% of XY elements, 20% of Xy elements, 
and 35% XV elements. 
ine a situation in which the gross valences 
of cue X and cue Y are the same as those 
shown in Table 1, but the cell entries in the 
matrix are quite different. We would then 
say that even though the two traces had cer- 
tain properties in common, their informa. 
tional structures were different, 


Reduction Method 


Although the basic idea u 
method of retrieval cuing for 
describing memory traces is s 
plementation is a bit more 
Complications arise because the 
the two cues, X and Y, may de 
order in which the 


It is easy to imag- 


nderlying the 
quantitatively 
imple, its im- 
complicated, 
valences of 
pend on the 


y are presented. This 
would be expected if the first cue changed 


the trace (retrieval-induced recoding of the 
trace), since the two cues would then þe 
applied to different traces. If it were not 
for such potential confounding, we could let 
the data pattern in Table 1 represent the 
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structure of the trace. But given that it is 
indeed possible, if not probable, that prompt- 
ing recall of an event with one cue will 
change the valence of a second cue, another 
method for describing the trace is required. 
Our response to this requirement has been 
the development of the reduction method. 
Consider again the basic experimental sit- 
uation that we have described: A trace of a 
certain list item exists in memory and is 
probed successively with two cues, X and Y. 
But now the two cues are applied to the 
trace in two different cue-order conditions. 
In one condition the target is first probed 
with cue X and then, after other interpolated 
tests, with cue F ; in the other condition the 
order of the two cues is reversed. Data 
from the two cue-order conditions are tabu- 
lated separately in the form of two 2 x 2 
tables. The data from these two tables are 
then combined into a third table, one that 
directly describes the trace structure, 
. Table 2 shows a set of data from an imag- 
mary experiment in which gross, common, 
and reduced valences of two cues, X and Y, 
are obtained in two cue-order conditions as 
Just described. The 2 X 2 table on the left 


(identical to Table 1) shows the results for 
the condition in Which cue X was presented 
first and cue y 


i second, and the middle 2 x 2 
table depicts the Tesults with cue Y pre- 
sented first and cue X second. We will 
refer to these tables as data matrices, The 
right-hand 2 x 2 table, which we will call 
the trace matrix, is composed of selected 
data from the two data matrices. As Table 

Shows, only two entries in each data ma- 
trix are used in constructing the trace ma- 


i the gross valence of the first cue and 
the reduced valence of the second. The re- 
maining entri 


n es in the trace matrix are ar- 
rived at by subtraction, in keeping with the 


Properties of 2 x 2 contingency tables. 

The trace matrix is constructed in a way 
that Minimizes the effect of cuing order and 
Possible retrieval-induced recoding of the 
trace. Our basic assumption is that present- 
ing a cue might cause a recoding of the 


target trace only if the cue is successful in 
effecting recall; a cue that does not elicit 
recall of the target item is assumed to leave 


the trace intact, Thus, in describing th* 


Tr 
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TABLE 2 
Two Dara MATRICES AND RESULTANT TRACE MATRIX: HYPOTHETICAL DATA 


Data matrices 


Trace matrix 


Second cue Y Second cue X Cue Y 
First cue X First cue Y Cue X 
+ - Total + = Total + æ Total 
+ 40 .05 45 + 27 .25 .52 + SE 5g 45 
> 20 35 99 - J3  .35 48 - 20 435 459 
Total 60 .40 Total 40  .60 Total 52 48 


Note. Boldface indicates 
constructing trace matrix. 


trace we must take care not to use data that 
may be contaminated by the effects of a suc- 
cessful first cue. As Table 2 shows, no such 
contaminated data from the data matrices 
are used in constructing the trace matrix. 
The gross valence of the first cue is, of 
course, free of any contamination by the act 
of retrieval, and so is the reduced valence of 
the second cue, since this is based on only 
those cases in which the first cue did not 
effect recall.' 

Two things should be noted about this 
method of estimating the gross, common, 
and reduced valences of the two cues with 
respect to a given trace. First, since we 
wish to avoid the potential problem of a 
change in the structure of a trace as a con- 
sequence of the act of retrieval, and since we 
therefore must exclude the data based on the 
retrieval of items previously retrieved, it is 
necessary to derive a trace matrix from two 
separate data matrices, each corresponding 
to one of the two possible test sequences. 
Provided our assumption is correct, this 
procedure gets around the difficulty created 
by the possible effects of retrieval on the 
trace. The effects of retrieving a trace on 


one occasion on retrieving it on a subsequent 
1 The question has been raised whether the de- 
rived trace matrix represents the state of the trace 
at the time of the first, probe or at the time of the 
second probe. There is no satisfactory answer to 
this question. Since the information in the memory 
Store about any particular item probably changes 
in the interval between the two probes, our methoc 
can only be said to reveal the state of the trace as 
it "exis" at some time in the interval. The length 
of the interval between the two probes, of course, 
Can be manipulated jn experiment and the accom- 
banying changes in the trace structure observed. 


two entries from each data matrix (gross valence of first cue; reduced valence of second cue) used in 


occasion represent an interesting research 
problem, but whatever these effects are they 
are not relevant to the reduction method. 
The second point has to do with the prob- 
lem of combining the data from the two data 
matrices into a single trace matrix. In the 
illustration shown in Table 2, the critical 
values for the two data matrices were en- 
tered directly into the trace matrix. Such 
direct transfer of observed data is possible 
if and only if the XY cells (probabilities of 
recall of target to neither cue) in the two 
data matrices are identical, as they are in 
Table 2 (.35 in both cases). But even if 
our basic assumption—that the proportion of 
items recalled at all (and hence the propor- 
tion not recalled at all) is independent of 
cuing order—is valid, small measurement 
errors or “noise” effects are to be expected. 
The critical entries in the data matrices will 
therefore usually require aligning before 
they can be combined into a trace matrix. 
The most appropriate method of adjusting 
the data is simple to carry out, but in order 
to describe it unequivocally it is necessary to 
adopt a somewhat formal terminology. Con- 
sider the two data matrices and the corre- 
sponding trace matrix presented in Table 3. 
"These matrices are shown 1n two forms, one 
in general symbolic notation, the other con- 
taining numerical data from a hypothetical 
experiment. The symbols X and Y refer to 
cues X and Y, and subscripts 1 and 2 indicate 
the order of presentation of the two cues. A 
plain cue symbol indicates successful recall 
of the target in the presence of the desig- 
nated cue, while a symbol with an overbar 


indicates unsuccessful recall. Valence is in- 
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dicated by the symbol V. Thus, for in- 
stance, (Y) represents the gross valence 
of cue Y presented first, V(Xi;Ys) repre- 
sents the reduced valence of cue Y, where 
cue X was presented first, and so forth. In 
the following discussion, the symbol V is 
omitted as redundant. In the trace matrix, 
the symbols X and Y have no subscripts, 
since they represent proportions of trace ele- 
ments independently of cuing order. For 
instance, X Tepresents the proportion of 
non-X elements in the trace, XY represents 
the proportion of XY elements, and so forth. 
There is, of course, no need to adjust all 
the entries in the data matrices, but only 
those (critical) entries that are to be trans- 
ferred to the trace matrix, The trace matrix 
entries are derived from the critical entries 
of the data matrices by using the following 
equations ; 


XY = (X Y. X.Y1/2 


(1) 
= (38 + .32)/2 = 35, 


X 


Xia — XY)/a — XY)] (Q2 
BEU = .35)/(1 = -38)] = .45, 
Y= YE- Z/a — X,Y)] 
8L — .35)/(1 — 32)] = 51 
= XUY[rü- Y XY 
= 1901 — 35) 774 — 38)] 
XY = 


ll 


(3) 


, 


i] 


2)] (4) 
= 20, 
XY — XY) ~ XY] (5) 
ASE — .35)/(1 — 32)] = 14, 


The equations ar 
Table 3, with c 
decimal places, 


The logic under] 


ll 


e illustrated with data from 
alculations Tounded to two 


ying Equations | to 5 is 
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tion is stated formally in the following equa- 
tion: 


XV. + Xa; + XY, 


= OY: X.Yi-4-XiY, (6) 
If Equation 6 is found to hold, then X;Ys 
will equal XY, and the critical values from 
the data matrices can be entered directly into 
the trace matrix, If Equation 6 does not 
hold, then it is assumed that the observed 
discrepancy between the left- and right-hand 
sides is attributable to sampling variability 
and measurement error. The best estimate 
of the true Proportion of items recallable 
to at least one cue is then given by the 
mean of the two sides or Equation 6, and 


each side of the equation is adjusted to this 
best estimate, It is 
ment and sa 


terms of the 
he adjustment is made 
erm by the ratio of the 
to the value of the side 
l This adjust- 
quation true, and con- 


r adjusting the data in the 
manner just indicated, in practice it is easier 


to work with the quantities ¥ P, and XV, 
hich are the complements of the values of 
i e two sides of Equation 6. This is what 
ee been done in Equations 1-5, The un- 
erlying logic is exactly as described, but 


the calculations indic i 

i ated by E -5 

are simpler, PO 
Actually on] th i 

formulas y free appropriately selected 


given by Equations 1-5 d be 
applied; the other values doe 


simple Subtraction, in kee 
tables. 


can be obtained by 
Ping with the prop- 
In the illustrative 
S well as in most actual 


"S Supporting our basic 
88esting that the violation 
Equation 6 is attributable 


—— n—( 
=—<_ 2€ 
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TABLE 3 
DATA AND TRACE MATRICES 


Data matrices Trace matrix 
. Second cue Y Second cue X Cue Y 
Bist cue n Cue X 
+ — Total T - Total + -— Total 
Symbolic notation 
+ V(Xi¥s) VQüE) VG) + vs) XQUY) VO) T XY XP 
n V(XiY) VQüY3 V(X) = V(XiY,) VQGY) VV) = Xy EY 
Total V(Y:) V(Y Total V(X2) V(X3) Total | ^UE 
Illustrative data 
+ .36 .07 A3 + 232 21 53 + .31 .14 .45 
"d .19 .38 57 - A5 232. AT - .20 .35 .55 
Total .55 AS Total AT .53 Total | .51 .49 


Errkcrs or ENCODING CONDITION AND 
RETENTION INTERVAL ON TRACE 
SrRUCTURE: A DEMONSTRA- 

TION EXPERIMENT 


We are now ready to explore the struc- 
ture of traces created in experimental situa- 
tions. We will describe a demonstration 
experiment in which encoding conditions 
and the length of retention interval were 
manipulated as independent variables and 
then analyze the cued recall data by the re- 
duction method. 


Method 


Subjects were presented with six different lists 
of 16 to-be-remembered words each. Each to-be- 
remembered word was typed in capital letters and 
accompanied by a context word (input cue) typed 
in lower case letters. One half of the context 
words were associatively related to the to-be-re- 
membered words, while the other half were words 
that rhymed with the to-be-remembered words. 
For instance, the associative (A) context for the 
to-be-remembered word CHAIR may have been desk, 
and the rhyming (R) context (presented with the 
same to-be-remembered word to another subject) 
may have been pair, The two types of context 
words defined the two encoding conditions of the 


experiment, and they appeared in the list in an 
essentially random order. Each list was presented 
Visually just once, at the rate of 2 peus pe 
word pair. A short digit task was interpolated 
between the presentation and testing of each usn 

Subjects were asked to study each word pair anc 
cd-recall test, with the context 
Words serving as cues for the to-be-recalled target 


words. In fact, to encourage subjects to continue 
to pay attention to the context words throughout 
the six lists, the target words were indeed tested 
with their respective input cues after the critical 
testing phase of each list; we will ignore the data 
from these input-cue recall tests, since they are of 
no relevance to our present concern. 

The critical data were obtained from the ex- 
tralist-cue tests interpolated between the presenta- 
tion of each list and its test with the input cues. 
In these critical recall tests, each target word was 
probed twice, once with an associative extralist cue 
(e.g. table), and once with a rhyming extralist cue 
(e.g, hare). Subjects wrote their responses in a 
test booklet consisting of a number of pages. Dif- 
ferent forms of the test booklet were used to 
manipulate relevant and control irrelevant varia- 
bles. The first page of the booklet was used for 
recording the subject's name, the list number, and 
the digit string. Subsequent pages comprised the 
cued-recall test. With the exception of the last 
page of the booklet, which contained the input cues, 
each test page contained four rhyming cues and 
four associative cues. Each of the 16 targets was 
tested on two pages of the booklet, once with an 
associative extralist cue and once with a rhyming 
extralist cue. 

Of the 16 targets, 4 were tested o 
pages, 4 were tested on the first an 
the second and fourth, and 4 on t 
fourth pages. This manipulation of the po: E 
retrieval cues defined the length of the retention 
interval, the second main independent variable in 
the experiment. Of each of these subsets of four 
targets, two had been encoded with respect to an 
associative context word and two with respect to a 
rhyming context word. One word in each of these 
pairs of targets was first probed with a rhyming 
cith an associative cue, while for 


cue and second w 
the other word the order of cues was reversed. 


n the first two 
d third, 4 on 
he third and 
sition of 
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TABLE 4 


Tora, Worps RECALLED TO 
FORMAT, CUING ORDER, AND PRESENCE 


Associative (A) AND RHYMING (R) Cues 


AS A FUNCTION or ENCODING 
OR ABSENCE OF FREE-RECALL TEST 


Neither A 

Free-recall odi Only R Only A nor R 

Encoding format Order of cues test (AR) (AR) (AR) (AR) 
Associated word A, then R Yes 231 113 157 267 
No 242 96 159 271 

R, then A Yes 270 62 172 264 

No 278 67 142 281 

Rhyming word A, then R Yes 237 136 97 298 
No 232 126 97 313 

R, then A Yes 234 104 121 309 

No 238 94 126 310 


Note. The cell entries for each of the eight conditions sum to 768 


Thus for a given subject, each target word within 
a particular list represented a unique set of experi- 
mental conditions. The target words and condi- 
tions were balanced within each group of 16 sub- 
jects. 

The nature of the relation of each cue to its 
intended target was explicitly specified in the test 
booklet: For instance, “table (associated with) 


»” or “hare (rhymes with) The re- 
lation between the two cues of a given target, 
however, was never specified, 


subject saw the cue “hare (rhymes with) | — » 


target word sought here was, for ii 
as the one that belonged to the c 
ated with) ___” on page 1. 

An additional experimental variabil 
the presence or absence of a free- 
the extralist-cue recall test just di 
containing free-recall instructions 
between the first page and the 
pages for half of the booklets of 


nstance, the same 
ue “table (associ- 


€ concerned 
recall test before 
escribed. A page 
was interpolated 
extralist-cue test 
each subject, 


tralist cues, 
subject-words and 12288 observations. 


In summary, to-be-remembered words Presented 


Was subsequent] 
two extralist Cues, an associative wy, 
ing word. The positions of the 
cessive pages of the test booklet were s 
ically manipulated, and so was tl 
absence of an interpolated free-re 
data were treated by the reduction 


ystemat- 
1e presence or 
call test. The 
method, 


Results 


The results of m 
effects of encoding 
intervals on memo 


ain interest Concern the 
conditions and retention 
Ty traces obtained with 


» for a total of 6144 subject-targets, 

the reduction method. As a general back- 
ground to these results, Table 4 presents à 
Summary of the total number of words re- 
called in the eight experimental conditions 
defined by the orthogonal combinations of 


\ The data for the free-recall conditions 
include al] words, regardless of whether ot 


not they were produced by the subject in the 
free-recall test, 


hree points are 


: worth noting, The first 
is that the free-rec. 


all test made little differ- 
ence to the pattern or cuing results, For 
each Combination of encoding format and 
order of cues, the data are practically indis- 
tinguishable for lists tested and not tested by 
free recal] first. Thus, an initial free-recall 
test does not, or at least in this experiment 
did not, change the effectiveness of either 
the associative or rhyming cues, 

A second point of interest in Table 4 con- 
cerns the effect of encoding conditions on 
the gross valences of the two kinds of cues. 
“ncoding conditions affected the valence of 
associative cues, but not the valence of 
rhyming cues. The total number of words 


8 conditions; the cot- 
he rhyming cues were 
These data are very SiM- 
ntly reported by Nelson: 


Borden, and Brooks (1974). 


uu 4/ T Ru 


STRUCTURE OF MEMORY 'TRACES 


Finally, Table 4 shows that the violation 
of Equation 6 was negligible. W ithin each 
of the experimental conditions depicted in 
Table 4, the frequencies of responses in the 
AR category (words recalled to neither A 
nor R cues) were quite similar for the two 
cuing orders. 

For the analysis of principal interest, the 
data shown in Table 4 were entered into the 
data matrices, adjusted slightly in order to 
satisfy Equation 6, and then combined into 
trace matrices, one for each of eight experi- 
mental conditions. Table 5 lists for each of 
these conditions the four components of the 
trace, in the form of proportions of the rel- 
evant trace elements. The first four experi- 
mental conditions in Table 5 are defined in 
terms of the orthogonal combination of the 
free-recall and encoding format conditions. 
The trace components for these conditions 
were calculated on the basis of the data listed 
in Table 4. The other four conditions con- 
cern only those subject-lists for which the 
free-recall test was given; they are given by 
classifying the target items according to 
whether they had been produced in the free- 
recall test (some 32% had in fact been pro- 
duced) on the one hand, and according to 
their encoding format on the other. 

The data in Table 5 (a) confirm the im- 
plications in Table 4, both of the lack of any 
marked effect of the free-recall test and of 
the effects of the encoding format on the 
gross valence of the associative but not the 
rhyming cues, and (b) show that the mem- 
ory traces of "more difficult" words, those 
not recalled in the free-recall test, are com- 
paratively weak in the AR component of the 
trace. 

Table 6 is concerned only with target 
words that subjects failed to produce in the 
free-recall test, and shows forgetting in the 
form of changes in trace structure during 
the course of the retention interval. The 
early testing refers to the targets cued on 
pages 1 and 2 of the test booklet, late testing 
to targets cued on pages 3 and 4 5. 

Forgetting is seen to be reflected primarily 


in the reduction of the AR elements. Re- 
duced valences of the two cues do not change 


very much during the testing period, and 
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TABLE 5 


Trace MATRICES FOR TARGET Wonps IN VARIOUS 
EXPERIMENTAL CONDITIONS 


Trace elements 


iti Encoding 

Condition neodlis | s AR AR AR 
No free-recall test given Associative | -19 33 2 .36 
Rhyming 16 27 46 41 

Free-recall test given 
All words Associative | .22 -28 AS 35 
Rhyming "6 .26 .18 .40 
Associative | .25 -55 10 .10 


Recalled words 
g | is 58 46 di 


Unrecalled words ive | 21 . 
Rhyming AY £41 


Note. A refers to associative cues, R to rhyming cues. 


indeed the reduced valence of the rhyming 
cues (the AR component of the trace ma- 
trix) seems to have increased from the early 
to the late tests. The disintegration of the 
trace revealed by the decrease in the propor- 
tion of AR could reflect a transformation of 


AR elements into AR and AR elements at 
the same time that AR and AR elements are 


transformed into AR elements, but for the 


time being this is only conjecture.? 


EVALUATION OF THEORY 


The theory and method for describing 
memory traces that we have outlined here, 
with gross, common, and reduced valences 
of two (or more) retrieval cues used to 
specify trace elements and hence trace struc- 
ture, is really a rather simple-minded one. 
Indeed its simplicity might be regarded by 
some as its major weakness. The theory 
does not predict any phenomena, nor does it 
explain any, at least in the sense in which 


2 The fact that the AR elements of the trace are 
reduced in the course of forgetting and the AR or 
KR elements are not suggests that it is not useful 
to think of the items contributing data to the AR 


cell in the trace matrix as “easy” items or as hav- 


ing been recalled mostly by “good” subjects. Were 
e, the problem would arise 


such identification mad 

of how to explain the forgetting of “easy” but not 
“difficult” items, or the forgetting by "good" but 
not "poor" subjects. Any attempt to identify dif- 
ferent cells in the trace matrix with items differing 
in difficulty or with subjects differing in ability 
also founders on the fact, inherent in the delayed 
test data in Table 6, that the conditional probabil- 
ity of recall to cue X, given failure of recall to, the 
other cue, Y, is higher than the simple probability 
of recall to cue X uncontaminated by amy earlier 


recall attempt. 
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TABLE 6 


Proportions OF Trace ELEMENTS DEFINED BY 
ASSOCIATIVE (A) AND RHYMING (R) Cuzs ron 
Worps Nor RECALLED IN FREE-RECALL 
Test 


Elements 


Condition AR AR AR 


Fa] 
E 


Associative encoding 


Early test 
Late test 


:22 .26 14,38 
49 .07 .24 .50 


Rhyming encoding 


Early test 14 .20 18 48 
Late test AS :.02 .23 357 
many people think of explanation. It rep- 


resents little more than an elaboration of the 
definition of the memory trace in terms of 
the relation between the conditions and 
product of retrieval. 

On the other hand, the theory enjoys sey- 
eral advantages over others, First, perhaps 
because of its simplicity, it can be used to 
define and describe memory traces of all 
kinds of events, using all kinds of retrieval 
cues. We have been concerned here with 
names of list words as the to-be-remembered 
materials, but there are no reasons why 
other aspects of list words, larger units of 
verbal materials, or other kinds of materials 
altogether could not be subjected to the same 
logic and the same experimenta] treatment. 
For instance, the copy cues (cues nominally 
identical with target items) used in a recog- 
nition test or the “invisible” cues assumed 
to guide retrieval in free recall could be 
treated in the same way as the A and R cues 
in our demonstration experiment, Second 


since we define the trace in terms of 


" 4 d query— 
Output relations, its description need not 
emerge in terms of selected features, or as 


an impoverished COpy, of the ir 
"Third, the trace matrix 
sion of the fact that the recall of an event 
varies with retrieval conditions, Hence 
there is no logical paradox between a fixed 
effect of a given experience and its variable 
memorial manifestations, Fourth, the trace 
matrix tells us not only about the 

efficiency of different type: i 
but also about how 


à Put item, 
1$ a forma] expres- 
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another. Finally, as we have shown in the 
demonstration experiment, it is a simple 
matter to find out how encoding conditions 
affect the trace of one and the same item. 
We hold the input items nominally constant, 
vary the conditions under which they are 
encoded, and then use the reduction method 
to arrive at the description of the trace. 
Our theory as presented here is by no 
means complete. Tt will be revised and 
modified as we learn more about it and as 
we meet new problems in exploring its util- 
ity. In this final section of the article, we 


will consider some of the more obvious crit- 
icisms of the theory, 


Trace Structure or Cue Effectiveness? 


i One of the questions that needs to be con- 
sidered concerns the identification of various 
kinds of retrieval cue valences with 


matrix with trace structure, 
refer to a pattern of cue valence 
ing the structure of the memor 
than simply discuss the data 
are,” namely, 
ness? 


We invoke the co 


Why do we 
es as describ- 
y trace, rather 
as they “really 
as measures of cue effectiye- 


A ncept of the memory 
race because it helps to integrate, organize, 


and make sense out of what otherwise would 
be an unconnected list of experimental pro- 
cedures and observations, The output from 
the memory system depends on previous in- 
Puts: The system can produce a certain out- 
Put after but not before a certain input. 
This means that the system after the input 
must be somehow different from the system 
before the input. We can, like many others 
before us, think of this difference as the 
memory trace, and this primitive idea helps 
make sense of the changes in the output 
from the system. But if we wish to go be- 
yond the mere assertion that an input into 
the system creates a new memory trace, we 
have to be more precise about how the trace 
is to be conceptualized, defined, and de- 
scribed, The theory outlined here attempts 
to do just this. Tt Provides (a) an inter- 
nally consistent set of statements about what 
the memory trace is and what its properties 
are, and (b) a set of objectively specified 


STRUCTURE OF 


rules by which experimental observations 
are translated into theoretical terms. 

The theory neither demands nor assumes 
any kind of “real” existence of some "thing" 
stored in the memory system. Memory 
trace, as defined by the theory and as de- 
scribed by the reduction method, represents 
a hypothetical construct that serves to orga- 
nize a potentially "large list of relations be- 
tween different questions directed at the 
system and the output from the system" 
(Tulving & Bower, 1974, p. 294). When 
these questions take the form of specific re- 
trieval cues, the trace becomes the referent 
of the cues. We argued earlier that it makes 
little sense to talk about the effectiveness of 
a particular cue or the relation between two 
particular cues. Whenever we state the va- 
lence of a cue it is necessary to stipulate its 
referent, and for us the referent is the trace 
ofa certain event. Thus we talk of valence 
of cue X with respect to trace TS 

The concept of trace structure, and its 
description in terms of two or more cues 
and their informational overlap, represents 
a straightforward elaboration of these ideas. 
The measurement and specification of just 
the gross valences of individual cues pro- 
vides only partial information about the 
changes that have taken place in the system 
as a consequence of a particular input. 
Probing the memory store with two cues 
aimed at one and the same target event, and 
observing the gross, common, and reduced 
valences of these cues, provides a richer 
picture of these changes. The similarity or 
relatedness of any two cues, X and Y, thus 
need not be determined by experimenters’ 
intuitions; they can be measured, with re- 
spect to a specified trace T, by the reduction 
method. The magnitude of the common va- 
lence of two cues in the trace matrix, rela- 
tive to the reduced valences, provides an 
objective measure of the extent to which the 


two cues contain the same retrieval infor- 


mation. 

It is for these reasons, then, that we prefer 
s the memory trace and its struc- 
than simply describe the ob- 
en inputs into and out- 


to discus 


ture, rather 
Jations betwe 


served re 
he memory system. 


puts from t 
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Memory Trace and Memory Organization 


Memory trace as defined and described by 
our theory always represents input events of 
a particular class. For instance, when input 
events consist of the presentation of individ- 
ual words to many people, the resulting 
trace structure will refer to the population 
from which the subject-words were sampled, 
under specified conditions of presentation 
and recall. It would be a fairly straightfor- 
ward matter to use just one item and many 
subjects, or many jtems and just one subject, 
and to describe the trace of classes of items 
defined in this manner. But it is not pos- 
sible to specify the structure of the trace of 
a particular event in some particular subject, 
just as it is not possible to specify the state 
of one particular atomic nucleus at some 
particular instant. 

Another issue concerns the fact that traces 
may be stored in an organized fashion (e.g. 
Mandler, 1967). A critic might wonder 
whether available evidence for organization 
in memory invalidates our method, which 
entails probing for individual items. There 
is, in fact, no conflict here. The organiza- 
tion of items in memory will be reflected in 
the trace matrices of these items. A corol- 
lary of this fact is that our method. provides 
an approach to the study of organization 10 
memory. For instance, our demonstration 
experiment showed different trace structures 
according to whether the target items had 
been presented in the context of associative 
or rhyming words (Tables 5 and 6). Fur- 
ther, our method readily lends itself to the 
investigation of the effects of intralist con- 
text effects on trace structure in a free- 
recall situation. Thus the organization of a 
given item, or set of items, can be manip- 
ulated—for instance, by adding conceptually 
related items to the list—and the consequent 
changes in traces described by the reduction 
method. 


Cue Effects Through Mediation? 

We have already noted that our theory 
was developed in the context of the encoding 
specificity principle: A cue is effective to the 
extent that its informational contents mate 
the informational contents of the representa- 
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tion of the experienced episode. It is this 
principle that permits the specification of 
trace elements in terms of retrieval cues, 
But the assumption of a direct match be- 
tween cue and whatever it is that has been 
encoded from an earlier event may be ques- 
tioned. 

It could be argued that the effect of a 
given cue could be mediated through some 
other cue, and therefore the valences of cues 
of a designated class i 


lustrated with our standard example. As- 
sume that the cue 


memory a number of rhyming words, includ- 
ing the word "chair," 
retrieval of “chair,” its memory location could 
have been examined for evi 


formation out implicitly 


formati > im retrieved 
chair" and matched it with corre: 


Sponding 


strongly suggest that, at least in thi 
ment, the degree of mediation wa 


É S at most 
rather small. If a rhyming cue 


alwa ys ef- 


information 
about the target item by first eliciting a word 
with semantic contents matching those of the 
target word, as illustrated in our example 
above, then the valence of the rhyming cue 
with respect to the trace reduced by the se- 
mantic (associative) cue should be zero. In 
fact, the proportions of AR and AR ele- 
ments in the trace matrices were quite large. 
A simple mediation hypothesis also founders 
on the forgetting data shown in Table 6, 
since it could not explain why the common 
valence of the A and R cues (AR) dras- 
tically decreased over time, while the re- 
duced valences (AR and AR) either stayed 
the same or even increased, 


In fact, the 
rieval mech- 


u e » Would find rela- 
tively few Constraints in the theory of trace 


Structure that has been outlined here, 
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; i iv i learning about a stimulus de- 
ding to theories of selective attention, ) 
a on attending to that stimulus; this is represented in two-stage models 


by saying that subjects switch in analyzers as well as lear 
This assumption, , 
in a formal model by the incorporation of a stimulus- 
parameter, œ, into the equations describing change: 
Theories of selective attention 


response associations. 


strength of stimuli. 
(a) that subjects learn to attend to 


ning stimulus- 
however, is equally well represented 
specific learning-rate 
S in the associative 

have also assumed 
and ignore relevant and irrelevant 


stimuli (ie., that œ may increase or decrease depending on the correlation 
im 


of a stimulus with reinforcement) and 


ship between the probabilities of attending to di 
an increase in o to one stimulus is accompanied by 
The first assumption is used to explain the ph 


tinctiveness and dimensional transfer, 
and blocking. Although the first assu 


second is not: Overshadowing and blocking are 
choice of an appropriate rule for changing c, such that œ 


that signal no change from the proba! 
other stimuli. 


In experiments on classical conditioning, 
the arrangement of a contingency between 
a stimulus and a reinforcer typically es- 
tablishes that stimulus as an effective 
conditioned stimulus (CS), which reliably 
elicits a conditioned response (CR). In 
experiments on instrumental learning, the 
reinforcement of a particular response only 
in the presence of a specific stimulus typi- 
cally establishes that stimulus as an ef- 
fective discriminative stimulus, which re- 
liably controls the occurrence of that 
EE 


The research reported in this article was sup- 
ported by grants from the National Research 
Council of Canada and by the United Kingdom 
Science Research Council. I am indebted to 
numerous people who have read earlier versions 
of the paper or patiently listened to me expound 
some of the ideas it contains. Although | have 
no doubt assimilated ideas from many of them, 
lam particularly conscious of having incorporated 
points suggested by A, Baker, A, Dickinson, 
G. Hall, E. Macphail, D. Medin, R. A, Rescorla, 
and N. S. Sutherland. I am very grateful to 
them all. a ; 

Inquiries concerning this article should be sent 
to N. J. Mackintosh, Laboratory of Experimenta] 
Psychology, University of Sussex, Brighton BN1 
9QG, England. 


276 


(b) that there is an inverse relation- 
fferent stimuli (i.e., that 
a decrease in æ to others). 
enomena of acquired dis- 
the second those of overshadowing 
mption is justified by the data, the 
better explained by the 
decreases to stimuli 
bility of reinforcement predicted by 


response. Whatever 
be between these tw 
unreasonable to exp 
the processes where 


differences there may 
9 procedures, it is not 
ect some similarity in 
by the CS or discrimi- 


native stimulus are established as effective 
signals. The two stimuli may signal dif- 
ferent events. 


r the simple delivery of a 
reinforcer in one case, and the delivery 
of a reinforcer contingent on the occur- 
rence of a response in the other—but one 
may hope that similar laws govern the 
acquisition of these signaling properties. 
My concern in this article, at any rate, 
1$ with the general question of how stimuli 
are established as signals or acquire asso- 
Clative Strength, and no distinction will be 
made between classical and instrumental 
experiments, 

Virtually all theories of associative learn- 
ing have taken as their st; 
assumption th 
ciative strengtl 


tween occurre 
the event it signals. 


ters more precisely, 
that the magnitude 


D 
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of increments to the associative strength 
of a particular stimulus, A, is some function 
of the current value of A’s associative 
strength, the simplest version of this as- 
sumption being that such increments are 
a linearly decreasing function of that value. 
This assumption is expressed formally by 
the application of linear operators to 
changes in associative strength: 


AVa = 00 — Va), (1) 


where Va = current associative strength 
of A; @ is a learning-rate parameter, the 
value of which is determined by the rein- 
forcer used within the limits Qe 0-« 1s 
and ^ = limit of V, also dependent on the 
reinforcer used. 

This may be a starting point, but it is 
clearly no more than that. For example, 
it does not provide any account of one of 
the most obvious features of both classical 
and instrumental conditioning, namely, 
the observation that increments in asso- 
ciative strength depend on the nature of 
the stimulus. Pavlov (1927) noted that 
a more intense stimulus was a more ef- 
fective CS than a less intense one, and 
that in the case of salivary conditioning 
with dogs, stimuli from some modalities, 
such as tactile and auditory stimuli, were 
more effective CSs than stimuli from other 
modalities, such as visual or thermal 
stimuli. Similarly, the wavelength of light 
to which pigeons are exposed is more likely 
to gain control over their food-reinforced 
instrumental responses than is the fre- 
quency of a tone sounded from a speaker 
(Foree & LoLordo, 1973). The simplest 
way of representing these and similar ob- 
servations is to assume that the rate of 
conditioning to 2 particular stimulus is 
not just determined by some general 
learning-rate parameter, such as 6, but is 
also affected by à stimulus-specific param- 
eter, which may be referred to as e. Thus, 


AVa = aa80 — Va) (2) 


where 0 <@ e. JE is now possible m 
represent differences 1n the rate of condi- 
tioning to stimuli differing 1n intensity Or 

ences in the value of @ 


salience as differ i 
associated with that stimulus. 1f condi- 


tioning proceeds faster with stimulus A 
than with stimulus B, or if subjects, given 
conditioning trials with a compound stim- 
ulus consisting of A + B, respond strongly 
to A but weakly to B, these facts may be 
represented by assigning a high value to 
aa, and a low value to an. 

To this point, the elementary ideas being 
advanced do not represent any significant 
departure from the central assumptions 
of traditional theories of learning, at least 
of theories such as those advanced by Hull 
(1943), Spence (1936, 1956), or Estes (1950, 
1959). At least two recently proposed 
models in this tradition, for example, have 
included a stimulus-specific learning-rate 
parameter such as @ (Kendler, 1971; 
Rescorla & Wagner, 1972). 


THEORIES OF SELECTIVE ATTENTION 


The framework expressed by Equa- 
tion 2 is, in fact, sufficiently general to 
include another group of theories, super- 
ficially quite distinct. Theories of selec- 
tive attention, of which several versions 
have been advanced in recent years (for 
example, Lovejoy, 1968; Sutherland, 1964; 
Sutherland & Mackintosh, 1971; Trabasso 
& Bower, 1968; Zeaman & House, 1963), 
appear to incorporate some quite radically 
new assumptions about the processes 1N- 
volved in even simple cases of instru- 
mental learning. Nevertheless, I shall 
argue that their two central assumptions 
can easily be represented as variants on 
Equation 2. 

Theories of selective attention have all 
been formalized as “two-stage’’ Or “chain- 
ing" models, where the subject is envisaged 
as first observing or attending to a set of 
stimuli on each trial, and then making an 
overt response determined only by those 
stimuli attended to on that trial. In this 
form, they seem to differ sharply from the 
“single-stage” type of model characterized 
by Equation 2. Yet although the func- 
tion of the observing oF attentional stage 
may be described informally as that © 
initially exposing the subject to a parti- 
cular set of discriminative stimuli so that 
those stimuli and no others are available 
for association with the outcome of the 
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trial, formally it is only this second point— 
whether or not a particular stimulus has 
its associative strength changed—that is 
of any substance.! 

This point has been represented in two 
different ways. In theories such as those 
of Zeaman and House (1963) and Lovejoy 
(1968), the subject is assumed to attend 
to only one set of stimuli on a trial. The 
probability of attending to a particular 
stimulus on a particular trial, therefore, 
determines the probability that the out- 
come of that trial will change the asso- 
ciative strength of that stimulus. In the 
theory of Sutherland and Mackintosh 
(1971), on the other hand, subjects are 
assumed to switch in several analyzers on 
a given trial. The strength of a particular 
analyzer, therefore, determines the amount 
by which the associative strength of a 
particular stimulus is changed. Over a 
series of trials, of course, these two for- 
malizations amount to very much the same 
thing. More important, however, insofar 
as the strength of switching in a particular 
analyzer or the probability of attending 
to a particular set of stimuli simply de- 
termines which stimuli change their asso- 
ciative strengths (and by how much), this 
characteristic of two-stage models is equally 
well represented by a single-stage model 
that incorporates a learn 
ter, o, as exemplified 
Although theories of sel 
have, indeed, incorporated 
tions, these assumptions 


terminants of the value of a, 

The first of these assumptions, common 
to all theories of selective attention, is 
that æ is not a fixed consequence of such 
physical characteristics of a stimulus ag 
its intensity or modality, but rather that 
« may vary with the subject's experience, 
The experience in question may have oc. 
curred before the start of an experiment, 


1 This is of course an oversimplification, for the 
direction of the subject's attention is also assumed 
to affect which stimuli will actually determine 
performance at any given moment, but the ques- 
ion of performance may for the moment be 
leferred. 


but in the more interesting case, it is a 
consequence of the correlation between the 
stimuli and the reinforcement to which 
the subject is exposed during the course 
of an experiment. In the language nor- 
mally used by such theorists, the proba- 
bility of attending to relevant stimuli 
typically increases, while the probability 
of attending to irrelevant stimuli typically 
decreases. In other words, if variations 
in stimulus A are correlated with changes 
in reinforcement, &a may increase, while 
if variations in Stimulus B are not cor- 


related with changes in reinforcement, 
«n may decrease. 


This is a necessary 
consequence of the “one-look”? assumption 
incorporated into the models of Zeaman 
and House (1963) and Lovejoy (1968). 
Since Subjects are assumed to attend to 
only one set of stimuli on a trial, it neces- 
sarily follows that as the Probability of 
attending to one set increases, So the 
Probability of attending to others must 
decrease, 


Mackintosh (1971), although allowing that 
learning can available 
stimuli on à single trial, assume that the 
Strengths of al] analyzers sum to 1.0. In 
theories of selective at- 
aid to have assumed an 
lip between the values 
he population of stimuli 
Siven experimental situ- 
hat situation can be concep- 


Ssumptions (that 
j n being fixed may 
: that the 
r different stimuli are in- 
ated ry other) have both 
: ; 6 Into essentially all theo- 
ries of selective attention, asd enc 


* 
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both have also been rejected by the 
majority of other theories of associative 
learning, it is important to see that they 
are logically independent. There is, in 
fact, no reason why the possibility that 
subjects might learn to attend to relevant 
stimuli or ignore irrelevant stimuli should 
carry with it the implication that such 
attention is selective in the sense required 
by the inverse hypothesis. One can con- 
ceive of a theory of associative learning, 
which while permitting œ for various stim- 
uli to change in accordance with the 
schedule of reinforcement associated with 
those stimuli, does not insist that a change 
in a to one set of stimuli is necessarily 
accompanied by an equal and opposite 
change in @ to other sets of stimuli. Since 
a theory that accepts one of the assump- 
tions common to most theories of attention 
without the other is thus clearly a logical 
possibility, it is important that the pre- 
suppositions and experimental evidence 
relevant to the two assumptions be exam- 
ined independently. 


Changes in Attention with Experience 


Traces of the assumption that the direc- 
tion of attention may change can be found 
jn both Lashley’s (1929) and Krechevsky’s 
(1932) early writings on discrimination 
learning. According to Krechevsky, sub- 
jects entered the experimental situation 
with a particular set of hypotheses about 
which stimuli might be relevant and which 
but in the face of inconsistent 
reinforcement would discard unsuccessful 
hypotheses and try, out new ones. Al- 
though hypotheses were thus said to be 
“docile,” the actual rules governing changes 


from one hypothesis to another were not 
specified, and the implication was that they 
were better characterized as consequences 
of a sudden new insight, or of the re- 
organization of the subject's perceptual 
field, rather than being due to any readily 
understood process of associative gir} 
In this context, it was Lawrence (1949, 


he ‘‘ac- 
5 tho first suggested that th 
n “dist f cues" might be 


1 to 
ái " amenable 
a gradual le 


analysis in 


irrelevant, 
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It was also Lawrence, of course, who 
first provided relatively convincing evi- 
dence that changes in a might occur. His 
experiments on simultaneous and succes- 
sive discriminations (Lawrence, 1949, 1950) 
and on transfer along a continuum (Law- 
rence, 1952) showed transfer from one 
discrimination problem to another, which 
seemed to imply increases in the value 
of a for relevant stimuli and/or decreases 
in the value of a for irrelevant stimuli. 
This is not the place for a detailed review 
of these studies, but although other ac- 
counts of these transfer effects have been 
offered, there is reason to question their 
sufficiency (Sutherland & Mackintosh, 
1971). Similarly, although experiments 
on discrimination reversal undoubtedly in- 
volve a number of complex processes, it 
can be argued that one reason why over- 
training sometimes facilitates reversal 
learning, and why exposure to a series of 
reversals of a single discrimination some- 
times leads to eventually very rapid 
reversal, is that both overtraining and 
serial reversal training increase the value 
of a for the relevant discriminative stimuli 
(Mackintosh, 1969; Mackintosh & Hol- 
gate, 1969). Finally, studies of intra- 
dimensional and extradimensional shifts 
have shown that the magnitude of transfer 
from one discrimination problem to another 
depends on whether the stimulus dimen- 
sions that were relevant and irrelevant in 
the first problem retain their original sig- 
nificance in the second problem (Shepp 
& Eimas, 1964; Shepp & Schrier, 1969). 
It is very difficult to see how such à 
finding can be explained without assuming 
changes in a for relevant and/or irrelevant 
stimuli, which transfer to other stimuli 
falling along the same dimension. 

It may be accepted that many of these 
findings imply that «æ changes during the 
course of discrimination learning; they do 
not, however, enable one to specify the 
direction of such changes. As was im- 
plicitly acknowledged in the preceding 
paragraph, many of these results could as 
well be a consequence of a decrease in a 
for irrelevant stimuli, as of an increase 


in a for relevant stimuli. It is somewhat 
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surprising, therefore, to find that formal 
theories of selective attention have con- 
centrated on this second interpretation to 
the virtual exclusion of the first. It is 
true that Lawrence (1949) admitted that 
a decline in the distinctiveness of irrelevant 
stimuli was probably at least as important 
as an increase in the distinctiveness of 
relevant stimuli in generating the transfer 
effects he observed. But in the formal 
two-stage models of discrimination learn- 
ing proposed by Zeaman and House (1963), 
Lovejoy (1968), and Sutherland and Mac- 
kintosh (1971), while differential reinforce- 
ment is assumed to increase attention to 
relevant Stimuli, there is no comparable 
independent mechanism for reducing at- 
tention to irrelevant stimuli. Except in 
the sense that a formerly relevant analyzer 
Will revert to its baseline strength if the 
Stimuli it detects subsequently become 
irrelevant, the only mechanism for de- 
creasing the strength of irrelevant analyzers 
is by competition with relevant analyzers, 
It is, in fact, only the inverse hypothesis 
that ensures that as attention to relevant 
stimuli increases, so attention to irrelevant 
Stimuli must decrease. 

There are 
analysis. A 


Sequence of any incr a 
another set of stimuli, 


hich 


nonreinforced Preexposure to a Particular 


stimulus interferes with subs 
ditioning to that stimulus, 

by Rescorla (1971) and Reiss 
(1972) have established tha 
reinforced Preexposure does 
stimulus into a conditioned inhibitor o 
signal for nonreinforcement and indeed 
may significantly interfere with the estab. 
lishment of subsequent inhibitory condi 
tioning. It is difficult, therefore, to Bs 
how such results can be explained except 
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by postulating changes in a stimulus- 
specific learning-rate parameter such as a. 
Nor is it necessary to present a stimulus 
in the complete absence of reinforcement, 


as in typical experiments on latent in- 
hibition, in Order to retard subsequent 
conditioning. Indeed, sufficient exposure 


to uncorrelated presentations of a stimulus 
and reinforcer may be an even more ef- 
fective Procedure for reducing the asso- 
ciability of that stimulus with that rein- 
forcer (Mackintosh, 1973). 

, Latent inhibition | 
in experiments on free-operant instrumental 
discrimination learning by Mellgren and 


Ost (1969) and Halgren (1974). Although 
à similar finding | 


discrete-tria] studi 
what would seem 


effect, Tp experiments on simultaneous 
and Successive discriminat; learning 
, 


and Hall (Note 1) obtained 


&nore irrelevant 
l to trial without 
forcement, 

à considerable amount 
ent in a general way 
- assumption of theories of 
Ssociability of 4 stimu- 
nt is not an immutable 
at stimulus's 


Gi ACE physical 
atime istics. Exposure to a set of 
: 1 Correlated with changes in rein- 
Orcement 


elabi (pear to increase the asso- 
Y oi those and similar stimuli with 


Uture changes in reinforcement, Con- 
Variations in a set of 
1 the absence of correlated changes 
appears to decrease the 
lose stimuli with future 


a The fact that 
< Ses In œ occur, of COurse, doe 
e l : i rse, does 
t 2 Itself specify the nature of the rules 
oa anging & that a formal model would 
‘O Incorporate in order i 
. to provide 
a Satisfactory account of t p But 


that 


: decreases simply by 
Mcrease in attention to 
It seems probable that 


x 
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a symmetrical set of rules will be required 
that will permit opposite but independent 
changes in a to relevant and irrelevant 
stimuli. 


The Inverse Hypothesis 


The second main assumption held by 
most theories of selective attention is that 
stimuli compete for attention, so that an 
increase in attention to one set of stimuli 
will necessarily involve a decrease in at- 
tention to some other set of stimuli. It 
should be clear that the experiments re- 
viewed in the preceding section, designed 
to show that the probability of attending 
to a particular set of stimuli may change 
with experience, do not have any bearing 
on the validity of this second assumption. 
What then has been the justification for 
the inverse hypothesis? 

One line of argument has been that it is 
a necessary corollary of the assumption, 
derivable from information theory, that 
there are limits to any organism's capacity 
to process information. Stimuli must be 
assumed to compete for access to a chan- 
nel of limited capacity, and the formation 
of associations between one set of stimuli 
and reinforcement will necessarily be at 
the expense of the formation of associa- 
tions with other stimuli. Plausible as it 
may seem to insist on limitations on the 
capacity to process information, however, 
it may be questioned whether these con- 
cepts are of any substantial relevance to 
the sorts of situations to which theories 
of selective attention have been applied. 
Of the numerous differences between ex- 
periments on instrumental discrimination 
learning in animals, and those studies of 


dichotic listening or reaction times in 
human subjects on which the concept of 
a limitation on capacity has been based, 
perhaps the most important 1$ that in the 
former situation subjects control their own 
exposure to the discriminative stimuli; 
they are under no constraints to respond 
rapidly and there is little reason to Sup- 
y 

pose therefore; that they are processing 
info ‘mation at rates close to the limits 
rs v. Although it is true that 
of their capacity i HE Rn 

i classical conditioning 
in experiments on cla 


the experimenter controls the subject's 
exposure to the CS, the stimuli used are 
typically salient, often unlocalized, and 
may sometimes last up to several minutes. 
It stretches credulity to suppose that the 
rat's channel capacity is too limited to 
permit processing of both a loud tone and 
a bright light, when the two are switched 
on simultaneously for the duration of a 
3-minute trial in an experiment on con- 
ditioned suppression. Whether or not strict 
simultaneity of stimulus processing is im- 
possible, typical animal experiments have 
imposed no such requirement. 

It is obvious, however, that the major 
justification for a particular theoretical 
assumption is the extent to which it per- 
mits the derivation of experimental data. 
What are the data that are typically 
explained by the inverse hypothesis? They 
may generally be described as instances of 
“stimulus selection," data which suggest 
that the association of one stimulus with 
reinforcement does not depend simply on 
the characteristics of that stimulus, on its 
correlation with reinforcement, or even, 
apparently, on its past history, but may 
be affected by the nature of the other 
stimuli present at the same time, by their 
correlation with reinforcement, and by 
their past history. The most important 
instances of such data are provided by 
studies of overshadowing and blocking. 
The term overshadowing refers to the find- 
ing, first reported by Pavlov (1927), that 
the presence of an equally relevant, more 
salient stimulus may decrease or com- 
pletely prevent conditioning to a less 
salient stimulus. A dog would associate 
a weak thermal stimulus with the delivery 
of food, provided it was presented on its 
own, but if it was presented only in con- 
junction with a more intense auditory 
stimulus, no conditioning would accrue to 
the weaker stimulus. 1n general, condi- 
tioning to B often proceeds more slowly 
if B is always presented in conjunction 
with A than if B is presented alone. 

The term blocking was used by Kamin 
(1969) to refer to the finding that prior 
training on one element, A, of a com- 
pound stimulus, AB, might completely 
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prevent conditioning to the second ele- 
ment, B. If rats received a series of trials 
on which a compounds CS, consisting of 
a light and a noise, signaled the occur- 
rence of a shock, they would show sig- 
nificant conditioned suppression to both 
stimuli. But if these compound trials 
were preceded by a series of trials on 
which, say, the light alone signaled shock, 
they would show little or no suppression 
to the noise. 

Both overshadowing and blocking, it 
need hardly be pointed out, follow quite 
directly from the inverse hypothesis. An 
intense stimulus is precisely one that is 
associated with a high value of a, and its 
addition to the stimulus compound cor- 
related with reinforcement will necessarily, 
according to the inverse hypothesis, de- 
crease the value of œ associated with any 
other stimulus. Similarly, prior training 
with A relevant will increase «4 and hence 
ensure that when B is added, ag will be 
low. In spite of this apparently happy 
agreement between data and theory, how- 
ever, one can still ask whether blocking 
and overshadowing are explained more 
satisfactorily by a theory of selective at- 
tention than by any other theory, and 
whether they are observed on the occa- 
sions required, and for the Teasons speci- 
fied, by such a theory. There is reason 
to believe that they are not. 

Rescorla and Wagner (1972) proposed 

` an alternative explanation of these 


i a and 
other instances of stimulus selection. Ka- 
min (1969) suggested that blocking occurs 


because on compound trials the subject 
already expects the reinforcer and only 
unexpected reinforcers are reinforcing 
Rescorla and Wagner have formalized one 
aspect of this idea: The centra] 
tion of their model is that cha 
associative strength of a stimulus are in. 
versely related, not just to th 
associative Strength of that stin 


to the current strength of the en 
lus complex of which 


assu mp- 
nges in the 


€ current 
nulus, but 
f tire stimu- 
it forms a 


Part. 
Formally, 

AVa = 0(\ — y), (3) 
where V is the sum of the associative 
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strengths of all stimuli (including A) 
present at the moment of reinforcement. 
In an experiment on blocking, a sufficient 
number of reinforced trials with A alone 
will ensure that Va approaches 2; at this 
point, reinforced trials with AB will result 
in little or no conditioning to B, since 
(à — Y) will be nearly zero. 

Rescorla and Wagner's model, although 
clearly quite distinct from any theory of 
selective attention, may still, without gross 


distortion, be said to explain stimulus 
selection by appeal to an inverse hy- 
pothesis, 


: Stimuli are assumed to compete 
with one another, not for a limited atten- 
tional capacity but for a limited amount 
of associative strength conditionable by a 
given reinforcer. Other things being equal, 
the greater the associative strength ac- 
cruing to one element of a compound CS, 
the less will be the associative strength of 
the other. Unlike theories of selective 
attention, however, Rescorla and Wag- 
ner’s theory does not postulate what may 
seem to some implausible limits to an 
animal's capacity to attend to incoming 
stimuli; Moreover, as we shall see, there 
are cases where its Predictions differ from 
those derivable from theories of selective 
sa age and cases where it is supported 
fre evidence. I shall argue, however, 

lat in the final analysis, the data on 
stimulus selection do not support any 
version of the inverse hypothesis : Blocking 
and overshadowing are not the conse- 
ae o: any simple competition between 

9r some limited resource, 
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available d 
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à signal for rein- 
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is not satisfied, blocking may not occur. 
In Kamin's (1969) experiments on blocking 
in conditioned suppression, if A is estab- 
lished as a signal for a 1 mA shock, and 
AB also signals a 1 mA shock, B acquires 
relatively little associative strength. But 
if AB signals a stronger shock than that 
signaled by A alone, or no shock at all, 
then significant excitatory or inhibitory 
conditioning accrues to B. In the latter 
case, Wagner (1971) has shown that if 
reinforced trials with A alone are followed 
by nonreinforced trials with AB, then an 
increase in the number of reinforced trials 
to A, so far from decreasing œg and the 
amount learned about B, actually ensures 
that B is established as an even stronger 
conditioned inhibitor. 

While it might still be possible in the 
face of this evidence to save the analysis 
of blocking suggested by theories of se- 
lective attention, for example by assuming 
that the values of œ change when the 
conditions of reinforcement change, it cer- 
tainly seems more appropriate to assume 
that any adequate explanation of blocking 
must take as its starting point the cardinal 
observation that blocking depends on the 
redundancy of the added element: Block- 
ing occurs because AB signals the same 
reinforcer as that signaled by A alone. 
This is, of course, precisely the position 
adopted by Kamin (1969) and Rescorla 
and Wagner (1972). But there is another 
way of explaining blocking in terms of 
redundancy, one which appeals neither to 
limitations of attentional capacity nor to 
limitations on the total associative strength 
conditionable by particular reinforcers. 
It is possible that the redundancy of B 
reduces conditioning to B because it 
reduces an. 

I have already argued that any adequate 
theory of associative learning must allow 
a to change and that this change must 
include à rule for decreasing « to stimuli 
orrelated with changes in reinforce- 
If it were assumed that «4 in- 
henever A signals a change from 
the prevailing or expected conditions of 
reinforcement, the symmetrical rule for 
decreasing attention to irrelevant stimuli 


unc 
ment. 
creases W 
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would say that ag decreases whenever B 
signals no change from the prevailing or 
expected rate of reinforcement. The ex- 
pected rate of reinforcement, however, is 
precisely that which is predicted by other 
stimuli in the experimental situation. 
Thus the blocking experiment, in which 
AB signals no change in reinforcement 
from that predicted by A alone, is pre- 
cisely the sort of situation that would 
result in a rapid decline in a. 

It is, in fact, possible to show that such 
a decline in æg is one of the things that 
happens when AB signals the same rein- 
forcement previously signaled by A alone. 
It will be recalled that if animals have 
learned that A signals a 1 mA shock, the 
presentation of AB, correlated either with 
a significant increase in the intensity of 
shock or with no shock at all, results in 
significant excitatory or inhibitory condi- 
tioning to B. Mackintosh and Turner 
(1971) found that a small number of 
trials on which AB signaled the same 
shock as A alone, interposed between 
initial conditioning to A and subsequent 
training with AB, significantly impaired 
this excitatory or inhibitory conditioning. 
The implication is that as decreased be- 
cause when B was first introduced it 
signaled no change in the probability or 
magnitude of reinforcement. 

It is, of course, one thing to say that 
if AB signals the same reinforcement pre- 
viously signaled by A alone, am will de- 
crease. It is another thing to say that 
this is the sole or even a major cause of 
blocking. This assertion has a number of 
further implications. If the blocking of 
conditioning to B depends on subjects 
learning to ignore B, for example, no 
blocking can be expected until they have 
done so. Where blocking is substantial, 
it must be assumed that changes in asso- 
ciative strength are slow relative to 
changes in a. If conditioning proceeded 
more rapidly, one might expect to see 
relatively little blocking. But even under 
normal circumstances, one would expect 
to see some conditioning to B, since it 
must take some minimal number of trials 
(at least one) to reduce a» to a point 
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where further changes in the associative 
strength of B are minimal. 

There is, in fact, some evidence con- 
sistent with this final implication. Kamin 
(1969) found it possible to detect some 
conditioning to B in his standard experi- 
mental situation; more important, his 
data suggested that most if not all of this 
conditioning was a consequence of the 
first reinforced AB trial. This observa- 
tion agrees with some results of an experi- 
ment on blocking in instrumental learning 
by Turner (Note 2). In this experiment, 
although pretraining on A resulted in some 
blocking of learning about B during AB 
trials, blocking was far from complete. 
Since animals receiving 100 AB trials 
learned no more about B than those re- 
ceiving only 10 AB trials, however, the 
implication is that learning about B was 
confined to the first few trials after its 
introduction. 

Although these results are consistent 
with the present argument, they do not 
bear on the strong prediction derivable 
from this analysis, that little or no block- 
ing should occur on at least the first 
reinforced compound trial. A satisfactory 
test of this prediction requires a situation 
in which reliable conditioning occurs to 
the elements of a compound CS after a 
single reinforced trial. The conditioned 
suppression of licking in thirsty rats by 
a CS signaling shock provides one such 
situation, and we have attempted to see 
whether under these circumstances pre- 
training on A does result in significant 
blocking of conditioning to B when only 
one reinforced AB trial is given (Mackin- 
tosh, Note 3). In several experiments, no 
evidence of blocking could þe found. 
These results suggest, therefore, that a 
Pee part of blocking observed in 
other situations may indeed be 4 con- 


Sequence of a rapid decline in à : 
to the added, redundant element pan 


Overshadowing 


The design of experiments on block; 
Z ock 
is such that the added stimulus B Ea 
no change in the level of Tihforcemen 
from that predicted by A alone, ahd (c 


was, of course, precisely this feature that 
made it possible to explain blocking in 
terms of a reduction in a, rather than in ; 
terms of the inverse hypothesis. But ins 
experiments on overshadowing, animals. 
may be trained with AB-signaling rein-1g 
forcement from the outset. If A has NCige 
already been established as a signal fc 
reinforcement, how can its presence de-.. 
tract from conditioning to B during reinzys 
forced compound trials, unless some apP£ein- 
is made to the idea that stimuli comp tical 
with each other for association with ret... 
forcement? If a subject simply receives 
reinforced trials with an AB compound, 
both A and B should be established as 
signals for reinforcement, and without in- 
voking the inverse hypothesis it is hard 
to see why the presence of one should 
cause any decline in attention to the other. 
There are, however, 
stances that would 
sufficiently close par 
of an experiment or 


l- 


at least two circum- 
appear to provide a 
allel to the conditions 


1 n blocking to make it 
possible to apply the principle of learned 


irrelevance to the case of overshadowing. 
If A is better correlated with reinforcement 
than B, or if A is a much more salient 
stimulus than B and therefore conditions 
more rapidly, then there is a sense in . 
which B will be a redundant signal of 
reinforcement, In the former case, A, by 
virtue of its superior schedule of rein- 
forcement, will acquire associative strength 
more rapidly than p. In the latter case, 
even though the schedule of reinforcement 
associated With A and B is identical, since 
conditions more rapidly than B, A will 
be established as a reliable Signal for rein- 
forcement after a number of reinforced 
compound trials, while B will have ac- 
quired little associative strength. At this 
Point, just as in the case where the schedule 
of reinforcement associated with A is better 
than that associated with B, the situation 
will be analogous to that obtaining in 
experiments on blocking: The presence of 
Predicts nothing that is not already 
Predicted by A alone, and «p will decline. 
, Numerous Studies of compound condi- 
tioning have established that of two 
equally salient stimuli, the more valid 
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stimulus, associated with: a better schedule 
of reinforcement, will overshadow the less 
valid (e.g., Egger & Miller, 1963; Wagner, 
11969; Wagner, Logan, Haberlandt, & 
Price, 1968) and that of two equally valid 
timuli, the more salient will overshadow 
he less salient (e.g., Kamin, 1969; Miles 
Jenkins, 1973; Pavlov, 1927). What 
gas not been recognized, however, is that 
hese appear to be the only reliably 
‘tablished instances of overshadowing. 
^hough the more salient element of a 
Jipound CS overshadows the less salient, 
-fere is no evidence that the less salient 
will detract from conditioning to the more 
salient, nor is there much evidence to 
suggest that if two equally salient stimuli 
are associated with the same schedule of 
reinforcement, one will overshadow the 


other. 

If stimuli compete for association with 
reinforcement in the manner suggested by 
the inverse hypothesis, one would expect 
to observe a reciprocal interaction between 
all elements of a compound CS. If ani- 
mals receive a series of reinforced trials 
with AB, then, other things being equal, 
any conditioning to B must always be at 
the expense of conditioning to A, and 
vice versa. The experimental evidence 
does not support this expectation. Pavlov 
(1927) reported ihat overshadowing was 
a matter of a reduction in conditioning to 
the weaker of two elements of a com- 
pound CS as a consequence of the presence 
of the stronger. The unspoken implica- 
tion was that conditioning to the stronger 

Although Pavlov did not 


was unaffected. ‘lox 
present the data to support this inference, 
the conclusion is entirely consistent with 
subsequent experiments. 

Kamin (1969) and Mackintosh (1971), 
in studies of conditioned suppression, Te- 
ported that a bright light would over- 
shadow auditory stimuli of varying inten- 


sities, but neither found any evidence to 
the presence of the auditory 


stimulus detracted from conditioning to 
the light, nor that the amount of condi- 
tioning to the light was an inverse func- 
tion of the intensity of the auditory 
stimulus. Schnur (1971), using relatively 


suggest that 


285 


ed P sS visual stimuli in 
found no ae P Aedes ee 
c» ho e of overshadowing at all. 
e that in some of th “peri. 

ments ceiling effe tuese Ginen 
É cts may have obscured 

any evidence of overshadowing. In order 
to obtain further evidence on this point 
we have studied overshadowing in conte 
A snppression, using noise and light 

, , arying the intensity of the 
noise between 50 and 85 db. re 20 „N/m? 
To counteract ceiling effects, subjects 
were. tested over a series of trials in ex- 
tinction, to the point where no group 
maintained complete suppression. Al- 
though there was some evidence of re- 
ciprocal overshadowing at intermediate 
intensities of the noise, in the extreme 
cases there was no evidence of any such 
interaction. The most intense noise sig- 
nificantly overshadowed the light, while 
the least intense noise was itself signifi- 
cantly overshadowed by the light. In 
neither instance, however, did the over- 
shadowed element, even though itself ac- 
quiring significant associative strength, 
detract from conditioning to the stronger 
element. 

In experiments on instrumental dis- 
crimination learning, significant overshad- 
owing has been observed, provided that 
the overshadowed cue is either extremely 
difficult to discriminate (Lovejoy & Rus- 
sell, 1967) or both relatively difficult and 
in a different location from the more 
salient, overshadowing cue (Sutherland & 
Andelman, 1967). When neither of these 
conditions is satisfied, overshadowing is 
not observed (Sutherland & Andelman, 
1967; Turner, 1968), and it may be dif- 
ficult to detect any evidence of an inter- 


action between the stimuli (Sutherland & 


Holgate, 1966; Warren, Derdzinski, Hi- 
rayoshi, & Mumma, 1970). The most 


of overshadowing in in- 
strumental discrimination learning is that 
by Miles and Jenkins (1973). Different 
groups of pigeons were trained on a suc- 
cessive discrimination either the 


presence or absence of a tone, a dif- 
ight intensity, or a combination 


ference in li ra com 1 
of both cues serving to distinguish posi- 


extensive study 


with 


286 


tive and negative trials. The discrimi- 
nability of the light cue varied across 
groups. The most striking feature of their 
results is that the presence of the tone 
detracted from control by the light only 
with the least discriminable value of the 
light, but for this group the presence of 
the light did not significantly detract from 
control by the tone. Conversely, the more 
discriminable lights did overshadow the 
tone, but in these groups the presence of 
the tone did not detract from control by 
the light. No group, in other words, 
showed simultaneous overshadowing of 
both light by tone and tone by light. 

It is too early to reach any unequivocal 
conclusion, but the weight of the evidence 
does not appear to provide much support 
for the inverse hypothesis. Overshadowing 
does not appear to be the general, recip- 
rocal affair required by any theory that 
assumes stimuli necessarily compete for a 
limited total amount of associative strength 
or for access to a limited-capacity channel. 
To a first approximation, stimuli that are 
overshadowed appear to be only those 
with lower salience or inferior correlation 
with reinforcement, which results in a 
slower rate of conditioning than that ac- 
cruing to concurrently presented, over- 
shadowing stimuli. Any interaction be- 
tween elements of a compound that is 
confined to this sort of situation may be 
quite satisfactorily explained b 
that postulates reductions in o 
that predict no unexpected 
reinforcement. 


y a theory 
to stimuli 
changes in 
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Stimuli may condition at different rates, 
and at least one cause of these differences 
may be represented, as in Equation 2, by 
differences in a stimulus-specific iertege 
rate parameter, a. The value of a i: 
initially determined by the physical char- 
acteristics of the stimulus and the subject/'s 
sensory apparatus, but it may also change 
with experience. In particular, if stinini- 
lus A is correlated with changes in reih- 
forcement, æa will increase, and if stimulüs 
B is not correlated with changes in rein- 
forcement, æg will decrease. The critical 
departure from traditional theories of se- 
lective attention is that in a situation 
where several stimuli, A, B, C, are pre- 
sented, an increase in æa does not neces- 
sarily or directly cause a decline in «æg 
or «c. Changes in oy and ac are inde- 
pendent of changes in o. They are caused 
solely by the correlation of B and C with 
changes in reinforcement and do not de- 
pend in any direct way on changes in aa. 


Rules for Changing a 


Even if it is assumed that there may be 
changes in attention to particular stimuli 
the assumption that such changes in at- 
tention are entirely independent at first 
Sight seems to rule out the possibility of 
explaining the types of interaction between 
stimuli that are exemplified by the phe- 
nomena of blocking and overshadowing. 
This is the conclusion accepted by Fisher 
and Zeaman (1973), who have recently 
Proposed a modification of the original 

aman and House (1963) model, one of 
the new features of which is 
of the inverse hypothesis, 
that their revised 
dict blocking or 
Normal conditions, 
the preceding section 


the validity of this 
the 


à rejection 
They noted 
theory would not pre- 
overshadowing under 
As the arguments of 
s perhaps have shown, 
conclusion depends on 
nature of the rules proposed for 
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similarly, disconfirmation of an expecta- 
tion results in a decrease in the strength 
of that observing response. Rules such 
as these will not, of course, predict any 
decline in the strength of an observing 
response to the redundant stimulus in an 
experiment on blocking. If a subject re- 
ceives a series of reinforcements signaled 
by AB, the fact that these trials were 
preceded by a series of reinforced trials 
signaled by A alone will have no effect 
on any changes in the probability of 
attending to B. The rules for changing 
observing responses make no reference to 
whether a particular reinforcer is or is not 
expected on the basis of other simulta- 
neously presented stimuli. 

A theory such as Fisher and Zeaman’s, 
therefore, in which observing responses 
change independently and simply in ac- 
cordance with the reinforcement schedule 
associated with each independent set of 
stimuli, while able to explain data on the 
acquired distinctiveness of cues, reversal 
learning, and intradimensional and extra- 
dimensional shifts, will not predict such 
phenomena of stimulus selection as block- 
ing and overshadowing. It is possible, of 
course, that such phenomena are in no 
sense attentional in origin. One might 
argue that the phenomena of acquired 
distinctiveness and stimulus selection could 
(and should) be handled by entirely dif- 
ferent theoretical constructs. An adequate 
theory of associative learning would be one 
that explained acquired distinctiveness by 
allowing @ to change in the manner pro- 
posed by Fisher and Zeaman (1973), and 
that explained stimulus selection in the 
manner proposed by Kamin (1969) and 
Rescorla and Wagner (1972). 

Although a case could be made for such 
a hybrid theory, it must be recognized 
that Rescorla and Wagner's analysis of 
blocking and overshadowing is a special 
se of the inverse hypothesis, since in- 
creases in the associative strength of one 
component of a compound CS are assumed 
to be at the expense of other components. 
As we saw earlier, therefore, the analysis 
is vulnerable to some of the arguments 
that can be advanced against any such 


cai 
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interpretation of stimulus selection. if 
overshadowing is not the general and 
reciprocal phenomenon postulated by the 
inverse hypothesis, and if little or no 
blocking occurs on the first trial on which 
a new component is added, this is evidence 
against Rescorla and Wagner's analysis 
just as much as against the version of the 
inverse hypothesis postulated by theories 
of selective attention. The force of the 
previous argument was precisely that 
overshadowing and blocking are conse- 
quences of reductions in the value of a 
associated with overshadowed and blocked 
stimuli and are therefore to be explained 
by the postulation of appropriate rules for 
changes in a. 

It remains, then, to propose such rules. 
Instead of saying, as do Sutherland and 
Mackintosh (1971) and Fisher and Zeaman 
(1973), that subjects learn to attend to 
and ignore stimuli to the extent that those 
stimuli successfully predict the outcome 
of a trial, we want to say that it depends 
on whether the stimuli are uniquely suc- 
cessful in their predictions. The intuition 
that we require to formalize is that a4 
should increase if A predicts an otherwise 
unexpected reinforcer, while æa should de- 
crease if A signals no change in reinforce- 
ment from the level expected on the basis 
of other events. There are presumably, 
a number of ways in which this might be 
done, but possibly the simplest is as fol- 
lows. The extent to which a reinforcer 
is predicted by A is represented by the 
absolute value of the term |\ — Val, 
where, as usual, À is the asymptotic asso- 
ciative strength conditionable by that re- 
inforcer, and V4 is the current associative 
strength of A. If we wish æa to increase 
whenever the outcome of a trial is pre- 
dicted by A better than by all other 
events on that trial, we could say, 


R= Val <|A — Val, 
(4) 


where Vx is the associative strength of all 
stimuli other than A present on that trial. 
Conversely, if we wish o4 to decrease 
whenever the outcome of trial is predicted 
by other events at least as well as by A, 


Aaa is positive if | 
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we could say, 
Aaa is negative if |A — Va|>|\ — Vx]. 

(5) 
Stimuli can, of course, be established as 
signals for nonreinforcement just as for 
reinforcement. The simple assumption 
that the value of à for nonreinforcement 
is either zero or some negative number 
will permit appropriate changes in a on 
nonreinforced trials.? 

It is easy to see how such rules will 
predict the occurrence of overshadowing 
and blocking under appropriate circum- 
stances; as will decrease if B is always 
presented in conjunction with A and A has 
previously been established as a signal for 
the reinforcer in question, or if, by virtue 
of its greater salience, A is initially asso- 
ciated with a higher value of œ and thus 
acquires associative strength faster than B. 
If it is further assumed that the size of 
the change in a is proportional to the 
discrepancy between |\ — Vs| and 
[A — Vx|, then as will decrease most 
rapidly in the blocking situation, where 
on the first trial on which B is introduced, 
Vs is near zero and Vx (which includes Va) 
is near X. Conversely, œa will increase 
faster if reinforcement is signaled by A 
alone than by a compound CS that in- 
cludes other stimuli as well as A. 
principle, therefore, some reciprocal over- 
shadowing would be predicted, but not if 
the stimuli were sufficiently salient to be 
associated with high values of a at the 
outset of conditioning. The data on the 
effects of salience on overshadowing pres- 
ently available are not sufficient to 
whether such an assum 
it might prove necessar 
different rules in light o 
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one could equally well 
sume that the omission of an expected reinfor, 
resulted in an increment in Some specific inhibitory 
process, which grew to some asymptote, Az, Ed 
that the net associative strength, V, of a stimulus 
was determined by subtracting this inhibitor, 
process from a separate excitatory process which 
Was incremented on reinforced trials, 
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Overshadowing will also be predicted if 
A is a better predictor of reinforcement 
than B, for example if reinforced trials 
signaled by AB are alternated with non- 
reinforced trials signaled by B alone. 
Under these conditions, the different sched- 
ules of reinforcement associated with the 
two stimuli will result in faster condi- 
tioning to A than to B, and the consequent 
reduction in ag will interfere with further 
conditioning to B. Similar arguments 
would apply to the other examples of 
relative validity studied by Wagner (1969). 

There is, however, one serious problem 
with this analysis. There is evidence that 
a less valid stimulus will not only fail to 
gain associative strength but also may 
apparently lose such strength as it had 
acquired during an earlier history of rein- 
forcement. Wagner et al. (1968) showed 
that a light accompanied by a pair of 
tones would acquire significant control 
over responding, but that when the cor- 
relation of the tones with reinforcement 
was increased, the light lost control, even 
though its own correlation with reinforce- 
ment was uncharged. Similarly, if B is 
more salient than A, it may initially ac- 
nr at 3 
? à reement. As train- 
li dine mat peu » the difference in 

ghs the difference in sali- 
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forcement, any decline in control by B 
may reflect the eventual decline in «s 
brought about by the relative change in 
B's validity. It is too early to say whether 
this is a satisfactory solution; as noted 
later, this general case provides the only 
compelling reason for assuming that æ af- 
fects performance as well as learning. 

The present theory states that blocking 
depends on a decline in @ to the added 
element; as such, blocking should never 
be complete and will depend on the rate 
of change in a. The most obvious factor 
determining the degree of blocking will be 
the extent to which the added stimulus 
signals some change in reinforcement. 

1f A alone signals reinforcement and AB 
signals nonreinforcement, then ay will tend 
to increase rather than decrease, since B 
must be a better predictor of nonrein- 
forcement than A. If AB signals a stronger 
reinforcer than that signaled by A alone, 
then ay will tend to decrease, but less 
rapidly than it would if AB signaled the 
same reinforcer as A. This should pre- 
sumably be sufficient to predict the un- 
blocking observed by Kamin (1969) in his 
experiment in varying shock intensity, 
especially when it is remembered that in 
his experiment AB signaled a 4 mA shock, 
and this would have been sufficient to 
produce substantial levels of conditioning 
to B in one or two trials. There is, how- 
ever, reason to believe that unblocking 
may be produced by the occurrence of 
any surprising event shortly after the 
presentation of the compound stimulus. 
Kamin (1969) reported that the delivery 
of a second shock 5 sec after each com- 
pound trial resulted in significant condi- 
tioning to the added element, even though 
this double shock did not appear to act 
as a stronger reinforcer. Gray and Ap- 
pignanesi (1973) observed a similar effect 
when they presented a brief auditory and 
al stimulus shortly after each com- 
If confirmed, such results 
would suggest that unblocking was not 
simply due to an increase Hm A enabling 
the added element to acquire associative 
the manner suggested by 
Unblocking 


visu | 
pound trial. 


strength, in | i 
Rescorla and Wagner (1972). 


289 


may occur whenever the added element 
predicts some event of consequence (such 
as the second shock in Kamin's experi- 
ment), whether or not that event is itself 
able to support conditioning. According 
to the present analysis, one could argue 
that the function of the surprising event 
was to prevent a decline in the value of 
@ associated with the added element, thus 
enabling conditioning between that element 
and the original unconditioned stimulus 
(UCS) to proceed normally. 

The application of the present rules to 
the case of acquired distinctiveness and 
dimensional transfer in instrumental dis- 
crimination learning is relatively simple. 
Relevant stimuli in a discrimination prob- 
lem predict the occurrence and omission 
of reinforcement more accurately than any 
other stimuli, and an increase in their æ 
values will follow from Equation 4. Simi- 
larly, irrelevant stimuli, uncorrelated with 
the delivery of reinforcement, are presented 
in conjunction with predictive relevant 
stimuli and should suffer a decline in 
a value. 

The prediction of the effects of non- 
reinforced preexposure to a CS and of un- 
correlated presentations of a CS and UCS 
requires that such a CS be regarded as 
forming a compound with a set of back- 
ground stimuli, C. The crucial point, then, 
is that the presentation of AC (CS plus 
background) predicts no change from the 
probability of reinforcement or nonrein- 
forcement predicted by C alone, and a de- 
cline in a4 should follow. There is, how- 
ever, some question whether this is an 
ideal account. In the first place, it is only 
the phenomenon of latent inhibition that 
necessitates the otherwise rather unhappy 
assumption that aa declines even. when 
à — Val is equal to |^ — Vx|. For 
reasons of theoretical symmetry, if for 
no others, one would expect this equality 
to produce no change in aa. In the second 
place, there is reason to believe that 
random presentations of a CS and UCS 
retard subsequent conditioning more se- 
than does simple nonreinforced 
this additional 
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to conditioning with that particular UCS 
(Mackintosh, 1973). One interpretation of 
this observation is that o is both stimulus- 
and reinforcer-specific. Changes in the 
associability of a tone and Shock, result- 
ing from exposure to particular correlations 
between the tone and shock, will produce 
only generalized changes in the associ- 
ability of that tone with other reinforcers. 

If further research were to confirm this 
Suggestion, the notion of @ as a stimulus- 
specific learning-rate parameter would need 
replacing with a parameter o4 n, the value 
of which determined the magnitude of 
changes in the associative strength of A 


represented the associability of a particu- 
lar stimulus With a particular reinforcer, 
changes in æ would be produced only by 


Scope of the theory—to be explained, 
perhaps, in terms of some simpler mecha- 
nism of habituation. 

In the absence of further evidence, 
further speculation along these lines would 
be idle. For the Present, the rules sug- 
gested in Equations 4 and 5 may be 
regarded as one way of representing the 
idea that the associability of à Stimulus 
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Starting Value of a: Representation of 


Stimulus Salience 


It is not enough to devise rules for 
changing a. A comprehensive theory must 
Specify the limits within which & may 
change and the conditions determining 
the starting value of a. "The simplest 
solutions to these problems are to allow a, 
asa learning-rate parameter, to vary be- 
tween 0 and 1 and to assume that the 
Starting value of « for any particular 
stimulus is positively correlated with the 
intensity or salience of that stimulus. 
While these are reasonable assumptions, 
they are not in fact sufficient to account 
for the effects of stimulus salience. 

Lovejoy (1968), in his formalization of 
a two-stage theory of selective attention, 
distinguished between the fixed, base-level 
distinctiveness of a cue and its changeable 
or directable distinctiveness, Only the 
latter was subject to modification by 
learning. The implication was that changes 
in attention to various cues due to ex- 
perience are "bounded, second-order effects 
which take Place within definite limits 
that depend on the initial 


Cues" (p, 60). The 


Owever much one might 
1 t attend to brightness cues 
in a jumping Stand, they are unlikely to 
‘neous brightness discrimi- 
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an easy or difficult brightness discrimina- 
tion after extensive overtraining were not 
predicted by such a model. 

It is equally easy to show that such an 
account is inadequate for the data of clas- 
sical conditioning. Although the main 
effect of differences in the intensity of the 
CS is a difference in the rate of condi- 
tioning, Kamin (1965) has shown that a 
very weak CS may result in a lower level 
of conditioning even at asymptote. 
Asymptotic differences in performance, of 
course, cannot be attributed to variations 
in a learning-rate parameter. 

Differences in the discriminability of 
S+ and S— in a discrimination problem 
or between the presence and absence of 
the CS in classical conditioning, therefore, 
seem to have more permanent effects on 
performance than can be encompassed by 
a model that represents such differences 
by assigning different values to a, but 
which then permits o for any stimulus to 
vary between 0 and 1 as a result of ex- 
perience. Instead of introducing a further 
parameter (such as a fixed a") for each 
set of stimuli, to represent their dis- 
criminability from the background or from 
other discriminative stimuli, it would seem 
more consistent with other psychological 
knowledge to rely on the notion of gen- 
eralization. Perkins (1953) and Logan 
(1954), for example, argued that the well- 
authenticated observation that condition- 
ing proceeds more rapidly with an intense 
CS than with one less intense may be a 
consequence of differences in generalization 
between the CS and the background. The 
more intense a CS, the greater the dif- 
ference between that CS and its absence 
(the background). Reinforcement in the 
presence of an intense CS and nonrein- 
forcement in its absence, therefore, will 
result in less generalization of excitation 
to the background and less generalization 
of inhibition from the background to the 
CS than will a comparable amount of 
differential reinforcement correlated with 
the presence and absence of a less in- 
tense CS. By the same line of argument, 
the greater the difference between S+ 
and. S= in à discrimination problem, the 
greater the generalization of excitation to 


S— that will result from each reinforce- 
ment of S+, and the greater the general- 
ization of inhibition to S+ that will result 
from each nonreinforcement of S— (cf. 
Kendler, 1971). 

These ideas can be formally represented 
in a relatively simple way by introducing 
a parameter, .S;,;, representing the simi- 
larity of the ith and jth stimuli, such that 
0<S <1. With two discriminative stim- 
uli, A; and As, reinforcement of A will 
result in an increment in the associative 
strength of A; according to the usual 
equation : 


AVa, = o,0( — Va). (2) 


By generalization from A; to A», however, 
this trial will also result in an increment 
in the associative strength of A», according 
to the equation, 


AVa, = Sa aga (^ — Va). — (9) 


Similarly, nonreinforcement of Ag will de- 
crease the associative strength of A», and 
will also result in a proportional decrease 
in the associative strength of Au. 

A sufficiently high value of Sa,,, will 
ensure that discriminative performance be- 
tween A; and A, will never be perfect, 
just as a sufficiently high value of Six 
(strong generalization between A and its 
absence) will ensure that even at asymp- 
tote the level of conditioning maintained 
by a weak CS will be below that main- 
tained by a more intense CS. With 
suitable performance rules, intermediate 
levels of generalization will not affect per- 
formance at asymptote, but the general- 
ization parameter will still provide an 
additional factor (over and above a) de- 
termining rate of learning. It is possible, 
therefore, to allow that with sufficient 
training the value of « associated with 
any stimulus can approach 1, without 
thereby abolishing differences in rate of 
learning correlated with differences in in- 
tensity or discriminability. 


Specificity of a 


Most formal theories of selective atten- 
tion have followed Zeaman and House 
(1963) and Sutherland (1964) in assuming 
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that subjects learn to attend not just to 
specific stimuli, such as a vertical line, 
black door, or red response key, but to 
stimulus dimensions, such as line orienta- 
tion, brightness, or hue. Sutherland and 
Mackintosh (1971), for example, assumed 
that changes in attention could be repre- 
sented by changes in the strengths of 
various analyzers, each of which detected 
variations along a particular stimulus di- 
mension. The assumption is a rather 
natural one for theories designed to ac- 
count for the data of experiments on 
simultaneous discrimination learning, where 
subjects are usually confronted on each 
trial with a pair of stimuli, such as vertical 
and horizontal lines or black and white 
doors, which can clearly be described as 
differing along such dimensions. As Jen- 
kins and Sainsbury (1969) have noted, 
however, it may not seem quite so natu- 
rally suggested by a successive discrimina- 
tion between the presence and absence of 
a specific stimulus, and it can be applied 
only with some difficulty and little success 
to certain of the data obtained in such 
a situation. 

There are, of course, experimental data 
that seem to provide good evidence for the 
dimensionality of attention. In studies of 
intradimensional and extradimensional 
shifts, the specific stimuli to which the 
subject is exposed change from one stage 
of the experiment to another; the observa- 
tion that learning of the shift problem is 
faster if the relevant stimuli of the first 
problem differed along the same dimension 
as those of the shift certainly suggests that 
the changes in attention resulting from 
original training were changes in attention 
to entire stimulus dimensions, Similarly 
if the phenomenon of transfer along a con- 
tinuum (Lawrence, 1952) is to be inter- 
preted in attentional terms, the implica- 
tion is that an increase in the probability 
of attending to stimuli differing widely 
along a particular dimension results in 
increase in the probability 
another set of stimuli diffe 


along the same dimension. 


an 
of attending to 
ring less widely 


It is questionable, however, whether data 
Such as these require the assumption that 
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analyzers appropriate to entire stimulus 
dimensions are strenghtened and weakened. 
One could equally well assume that an 
increase in attention to one stimulus gen- 
eralized to other stimuli in proportion to 
their similarity to the training stimulus. 
Thus an increase in G4, resulting from 
the correlation of reinforcement with the 
presence of A; and nonreinforcement with 
its absence, might result in graded in- 
creases in a4, aa, . . . æa, Similarly, if 
Ai signaled reinforcement and A, signaled 
nonreinforcement, as in an experiment 
on discrimination learning, generalized 
changes in æ would accrue to the inter- 
vening stimuli A, and A; from both Ai 
and A,. 

This approach has the advantage of 
again narrowing the discrepancy between 
traditional, single-stage theories of asso- 
ciative learning and two-stage theories of 
selective attention. No appeal is made 
to hypothetical observing responses or 
analyzers detecting dimensions of stimulus 
change. A change in the state of one 
stimulus, whether of its 
strength, V, or of its learnin 
eter, o, is said 
stimuli in accord 
tional (even 
There is the 


associative 
g rate param- 
to generalize to other 
ance with entirely tradi- 
if unspecified) assumptions. 
erc further advantage that this 
revision may explain the relatively 
differences often observed 
of intradimensional 


small 
in comparisons 
f and extradimensional 
shifts in animal subjects. If changes in a 
to one pair of stimuli differing along a 
Particular dimension result in only gen- 
eralized changes in æ to another pair, it is 
entirely reasonable to expect these latter 
changes to be rather small, 
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an appropriate instrumental response. Àl- 
though deliberately vague, this is neither 
contentious nor problematic. "Theories of 
attention, however, are faced with the 
necessity of specifying further relations 
between their theoretical terms and their 
subjects’ responses. As noted earlier, in 
traditional two-stage theories the direc- 
tion of the subject's attention is assumed 
to affect not only what the subject learns 
as a consequence of the outcome of the 
trial, but also what response the subject 
makes on that trial. The probability that 
à rat will choose the black rather than 
the white arm of a T maze depends not 
only on the relative associative strengths 
of black and white, but also on the rela- 
tive strengths of attention to brightness 
and other stimuli. In the original one- 
look model of Zeaman and House (1963), 
the subject attends to only one stimulus 
dimension on each trial, and behavior on 
that trial is controlled only by the response 
tendencies to those stimuli. Sutherland 
and Mackintosh (1971) adopted a more 
complex solution, assuming that behavior 
was sometimes controlled by only one 
analyzer, but that sometimes more than 
one set of stimuli would determine 
behavior. 

There is, in fact, definitive evidence 
that in two-choice discriminations behavior 
can be controlled simultaneously by several 
sets of stimuli; rats can, after all, solve 
conditional discriminations. But there is 
equally good evidence pointing to this 
conclusion from studies of simple simul- 
taneous discriminations, where solution 
could in principle be based on only one 
set of stimuli at a time. When more than 
one set of stimuli is simultaneously (but 
redundantly) relevant, it is possible to 
Show that performance may be controlled 
by both on a single trial (Sutherland & 
Mackintosh, 1971, p. 142). Even in the 
case where one set of stimuli is relevant 
and another irrelevant, there is evidence 
that both relevant and irrelevant stimuli 


may combine to control performance. 


This is suggested by the fact that rats, 

trained on a simultaneous visual discrimi- 

nation, having responded consistently to 
on, £ 


one position over a long series of trials, 
may make no errors once they break their 
position habit (Sutherland & Mackintosh, 
1971, p. 91). Analysis of the performance 
of animals shifted from one discrimination 
to a second, in which the originally relevant 
stimuli become irrelevant, has shown that 
few if any errors occur on those trials 
when the new S4- is combined with the 
former, but now irrelevant, S+. Such 
results suggest that both stimuli contribute 
to the subjects’ choice behavior (Medin, 
1973; Tighe, 1973). 

These conclusions are quite consistent 
with the general trend of the present 
argument. It may be assumed, therefore, 
that all stimuli contribute to performance 
in accordance with their current associative 
strengths. The question remains whether 
one should assume that changes in @ not 
only determine the rate of learning about 
specific stimuli but also affect the extent 
to which a stimulus' associative strength 
is actually translated into performance. 
One could certainly assume that the con- 
tribution of any stimulus to performance 
was a product of its associative strength 
and « value. Thus as a4 declined, even 
if V4 had some moderate value, the con- 
tribution of A to performance would also 
decline. 

There is actually very little evidence 
bearing on this question. Since, in gen- 
eral, a stimulus with a low value of a will 
also be one with little associative strength, 
it may be very difficult to decide whether 
the reason why a particular stimulus fails 
to affect behavior is because it is not 
attended to (ie. has a low a value) or 
because its associative strength is low. 
There is evidence from studies of dis- 
crimination learning that differences in the 
associative strength of S+ and S— from 
an initial discrimination are preserved 
while the animal learns another problem 
with the initial stimuli no longer corre- 
lated with reinforcement—even though 
there may be no detectable evidence that 
such stimuli affect behavior (e.g., latency 
of responding) on the intervening problem 
(Mackintosh, 1963; Stettner, 1965). Simi- 
larly, experiments on generalization in 
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pigeons have established that in the 
presence of one stimulus (e.g., a colored 
background on the response key), another 
set of stimuli, such as the orientation of 
a line on the key, may exert little or no 
control over responding, in Spite of the 
fact that a particular line has been estab- 
lished as a signal for reinforcement, and 
reliably sloping gradients may be observed 
if different lines are presented on a black 
background (Newman & Benefield, 1968). 

Such instances of masking of control, 
however, can be explained without recourse 
to the assumption of competition between 
stimuli for control, The associative strength 
of the masking stimuli may be sufficiently 
great that ceiling effects obscure the con- 
trol being exercised by the masked stimuli. 
These results, therefore, cannot really be 
said to provide conclusive evidence of the 
influence of a on control. Until such 
evidence is Provided, it would seem reason- 
able to Suggest that a may simply be a 
learning-rate Parameter, with no effect on 
the control of behavior. Behavior would 
then be determined by a combination of 
the associative Strengths of all stimuli 
Present and impinging on the subject’s 
receptors. 

The only set of results 
a revision of this assumption is the ob- 
servation, noted earlier, that a stimulus 
imperfectly correlated with reinforcement 
may initially acquire associative strength 
but will eventually lose control if it is 
accompanied by other, more valid stimuli. 
The explanation provided by Rescorla and 
Wagner (1972) is that such 
loses associative strength due to competi- 
tion with more valid stimuli for a limited 
total amount of associative Strength. The 
Present theory is better able to explain 
why an invalid stimulus should not ac- 
quire associative Strength in the first Place 
than to explain why, having acquired 
Strength, it should then lose it, [t may 
be necessary to assume that although such 
a stimulus maintains its associative 
Strength, a decline in æ Will decrease the 
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CONCLUSIONS 


The theoretical ideas proposed here may 
be summarized briefly. Changes in the 
associative strength of a stimulus are 
partly determined by a learning-rate pa- 
rameter, æ, specific to that stimulus. This 
parameter is itself subject to change, in- 
creasing when a stimulus predicts a change 
in reinforcement, decreasing when it does 
not. This idea is formally equivalent to 
one of the main tenets of two-stage, at- 
tentional theories of learning, namely, the 
assumption that the probability of at- 
tending to a stimulus determines the 
probability of learning about that stimulus 
and may itself change with experience: 
This equivalence may justify character- 
izing the present set of ideas as a theory 
of attention, but since that term has a 
number of connotations, it might be better 
to stress that what I am proposing is a 
theory about the associability of stimuli 

ith reinforcement, At the risk of be- 
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ideas proposed here are more a program 
for a theory than a fully elaborated formal 
model of conditioning and discrimination 
learning. In our present state of knowl- 
edge, there is little need to apologize for 
this. It is more important to point to 
some factors that may limit the generality 
of the present arguments. 

First, I have assumed that it is possible 
to predict the behavior of a subject toward 
a compound stimulus, AB, solely from 
knowledge of the status of its elements, 
A and B. The possibility of “configural” 
conditioning or “compounding” shows that 
this is not always true: With sufficient 
training, animals can learn to respond ap- 
propriately when AB is consistently rein- 
forced but A and B separately are con- 
sistently not reinforced (e.g., Rescorla, 
1972a; Woodbury, 1943). There is good 
reason to believe, moreover, that configural 
learning occurs even when it is not ex- 
plicitly reinforced. Booth and Hammond 
(1971), for example, showed that con- 
tinued reinforcement of AB might be suf- 
ficient to reduce significantly the level of 
responding maintained by A and B alone, 
even without nonreinforcement of the com- 
ponent stimuli in isolation. 

Similar configural learning is a fairly 
common observation in operant discrimi- 
nation learning by pigeons. If pigeons are 
rewarded for pecking an illuminated re- 
sponse key in the presence of a 1,000 Hz 
tone but not in its absence, they will learn 
not to respond to the key light alone but 
will equally refuse to peck in the presence 
of the tone if the key light is turned off 
(Jenkins & Harrison, 1960). This case is 
of particular interest in the present con- 
text, for it is an example of the main 
unresolved problem for the analysis pro- 
posed here—how a salient but relatively 
invalid stimulus comes to lose control over 
responding after it has initially acquired 
such control. If pigeons stop pecking 
when the key light is turned off, it is 
obviously misleading to suggest that the 
light has lost control over responding. 
Perhaps similar configural effects occur in 
other, similar situations. There are, in 
fact, very few data available to show how 


far the apparent absence of control by B 
following reinforcement of AB reflects con- 
figural conditioning to the AB compound. 

A second factor ignored by the present 
analysis may also contribute to at least 
some cases where salient irrelevant stimuli 
lose control of responding. From the time 
of their earliest discovery, results showing 
apparently selective effects in conditioning 
and discrimination learning have been at- 
tributed by some investigators to the oc- 
currence of overt orienting responses. If B 
fails to acquire control over responding 
following reinforcement of AB, it is said 
to be because the subject oriented towards 
A and thus failed adequately to observe B. 
One of the few instances where there is 
actually good reason to suppose that such 
effects might operate is in the experiments 
of Jenkins and Sainsbury (1969), in which 
pigeons were trained to peck at visual 
displays containing discrete elements. In 
this situation, A and B would refer, for 
example, to small red and green dots or 
to a circle and a star; these elements were 
displayed on a large, specially constructed, 
response key, which enabled the experi- 
menters to record which element was 
pecked at on a given trial. If birds were 
trained on a successive discrimination be- 
tween AB+ and B—, Jenkins and Sains- 
bury found that they learned the dis- 
crimination by concentrating their pecks 
at A on reinforced trials. This suggested 
the possibility that when pecking at 
(orienting towards) A on reinforced trials, 
the pigeon associated only A with rein- 
forcement. Although nominally reinforced 
on 50% of trials, therefore, B lost control 
of responding because it failed to gain 
strength on reinforced trials. Although 
later finding that pigeons could, to some 
extent, associate B with the outcome of 
a trial even though pecking at A, Jenkins 
(1973) also obtained results that suggest 
that the (partial) selectivity of orienting 
responses in this type of situation may 
indeed play an important role in enabling 
birds to learn the discrimination between 
AB+ and B—. If in such a problem 
reinforcement was forfeited on positive 
trials whenever the birds pecked at A, 
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they continued to peck at B on both 
positive and negative trials and thus failed 
to learn the discrimination. Because pecks 
were directed at B, it gained strength on 
reinforced trials and thus continued to 
control responding. It is possible, there- 
fore, that in this situation at least, the 
loss of control by the relatively invalid 
stimulus, B, depends on a mechanism of 
selective orientation that lies outside the 
scope of the present analysis. 


Finally, it is necessary to acknowledge 
the possibility that there may be situations 
in which selective effects occur for the 
reasons postulated by traditional theories 
of selective attention. Mackintosh (1971), 
for example, found that significant over- 
shadowing of a weak tone by a strong 
light might occur on the very first trial of 
conditioning, Such overshadowing clearly 
cannot depend on subjects learning the 
redundancy of the tone nor on the mecha- 
nism proposed by Rescorla and Wagner 
(1972); it Suggests a perceptual or atten- 
tional interaction. It is possible that 
similar instances of attentional interactions 
would be observed more frequently if they 
were sought under more appropriate con- 
ditions. I earlier argued that the inverse 
hypothesis of theories of selective atten- 
tion did not rest on any very secure 
rationale: Animals may have a limited 
capacity for processing information, but 
it is hard to believe that 
prevents the simultaneous 
relatively salient stimuli typically used in 
studies of conditioning, If there is a limit, 
however, its effects should be apparent 
under suitable conditions, such as the brief 
controlled presentations of complex stimuli, 

This argument has also been advanced 
by Riley and Leith (in press), and its 
force is Suggested by the results of an 
experiment, undertaken by Turner (Note 


2), on delayed matching to Sample in 
Pigeons. Given un 
sample key, 


this limitation 
analysis of the 


they were going to be required 
the color or the line displ 
sample, 


to match 
ayed on the 


Under these conditions, therefore 


Estes, W. K. 


they were clearly able to attend to both 
features of the sample. But as soon as 
exposure to the sample was reduced to a 
fixed, very brief period of time, perfor- 
mance deteriorated sharply, unless the 
subjects were given an additional, condi- 
tional cue that Signaled whether the en- 
suing trial was going to require a line 
match or a color match. The implication 
is that pigeons were unable to analyze 
(or remember) both features of the com- 
pound sample when it was presented for 
only a brief interval. Provided the condi- 
tional cue signified which feature they 
needed to attend to, however, they main- 
tained accurate performance. There was 
evidence, therefore, that stimuli were com- 
peting for access to a limited capacity 
system, and the inverse hypothesis was fully 
supported. 

Although it may seem 
complex, it is possible that a complete 
analysis of conditioning and discrimination 
learning will require the assumption that 
the degree of competition between stimuli 
may vary from one extreme, where all 
available stimuli are analyzed on a single 
trial, to the other, where something like 
à one-look model may apply, 
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An Emotion-Attribution Approach to Moral Behavior: 


Interfacing Cognitive and Avoidance Theories 
of Moral Development 


Richard A. Dienstbier, Donald Hillman, John Lehnhoff, 
Judith Hillman, and Maureen C. Valkenaar 
University of Nebraska—Lincoln 


A theory is presented concerning the impact of attributions about the 
causes of emotional responses as they influence self-control in temptation 
situations. Research is reviewed indicating a high level of adult sensitivity 
to external influence in making such causal attributions. Two studies are 
presented in which the posttransgression emotions of second-grade children 
are labeled shame (because of being found out) or guilt (due to the trans- 
gression itself); when a similar situation was subsequently represented as 
safe from detection, shame-condition children transgressed 60-80% more 
than guilt-condition subjects. It is suggested that emotional arousal elicited 
in temptation situations because of past punishment or options that are in- 
consistent with the self-image is necessary for inhibition but not sufficient 
unless attributed to a relevant cause. The literature on the relative effec- 
tiveness of moral socialization techniques is discussed with respect to the 
theory, and the relevance to cognitive dissonance and to overjustification 
approaches to motivation is discussed. An integration of social-learning and 
cognitive-developmental theories is approached through explicating the 
translation of moral decision into behavior by focusing on the ways that 
cognition may exert partial control over the impact of less finely differen- 
tiated emotional response, allowing cognitive overrides of contradictory emo- 
tional dispositions without eliminating the emotion. 


Most of the modern literature on the de- 
velopment of conscience or morality is re- 
lated to one of two major approaches—cog- 
nitive, concerned with moral reasoning and 
decision processes, or social learning, con- 
cerned with the emotional states and be- 
haviors associated with self-control in the 
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face of temptation. Although both ap- 
proaches are supported by the data of a 
great many studies, the two approaches re- 
main essentially unintegrated. We seek to 
explicate some of the important steps in the 
translation of moral decision into moral be- 
havior and to apply that analysis to an ex- 
planation of why certain moral socialization 
techniques are more conducive to inter- 
nalization than othters. Our theoretical per- 
spective is presented in the early sections of 
the article; two supporting studies follow. 
The emphasis of the final section is on the 
relevance of the theory to traditional ap- 
proaches to morality and to the socialization 
literature. 

The theory can be stated in outline as fol- 
lows: Behavior associated with self-control 
in the context of temptation is heavily influ- 
enced by negative emotional states usually 
characterized as anxiety, fear, guilt, and/or 
shame. This emotional mediation of moral 
behavior continues into adulthood, when the 
resultant behavior may relate to higher 
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orders of moral judgment as defined by 
modern cognitive approaches to morality. 
Although the association of these negative 
emotional states with decisional and be- 
havioral processes may take place in part 
through mechanisms of conditioning and 
learning specified by traditional avoidance 
learning theorists (e.g, Mowrer, 1950), 
these associations depend heavily on the 
causal attributions that are made about the 
source of the negative emotions during so- 
cialization experiences. Social and situa- 
tional influences (often temporary) play an 
important role in forming causal attribu- 
tions; different physical and verbal socializa- 
tion techniques particularly provide different 
information relevant to the child's causal at- 
tributions. Later, in the face of temptation, 
the impact of the emotion response on be- 
havior will be heavily influenced by the still 
malleable beliefs held about the causal 
origins of the emotional response. When 
the individual believes detection of trans- 
gression is not possible, a negative emotional 
response in the face of temptation is neces- 
sary but not sufficient for the inhibition of 
transgression. In order for that emotional 
arousal to serve an inhibitory function, the 
individual must identify his emotional dis- 
comfort as due to a relevant cause, such as 
the transgression per se (and the implica- 
tions of the transgression for selt-image, 
etc.), rather than as due to an irrelevant 
cause, such as the fear of punishment, 


: 1 With 
maturity, control of emotional attributions 
passes from the socializing agent to the indi- 


vidual and becomes related to the level of 
moral development, although situational 
social cues continue to pl 
role. 


and 
ay an important 


EMOTION ATTRIBUTION 


The work of Schachter and Singer (1962) 
demonstrated that when subjects injected 
with adrenalin did not anticipate that arousal 
effects would result, they tended to attribute 
their emotional state to the Situation, feel- 
ing happier or angrier depending on the 
specific postinjection experimental context, 
That study was the first in a series by 
Schachter, his students, and others that in- 


dicated that people are quite flexible in the 
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manner in which they make causal inferences 
about the source and meaning of their emo- 
tional arousal. . 

Subsequent research exploring the impli- 
cations of those findings for moral behavior 
followed the work of Lykken (1957). Lyk- 
ken had shown that psychopathic individuals 
seemed not to experience normal emotion- 
induced inhibitions. Schachter and Latané 
(1964) demonstrated that psychopathic 
criminals (compared to normal criminals) 
would learn to avoid shocked errors (com- 
pared to nonshocked errors) in a lever-maze 
task only after sustaining arousal from an 
adrenalin injection. Schachter and Ono 
(cited in Schachter & Latané, 1964) found 
increased cheating by college students who 
had received the tranquilizer chlorpromazine 
(compared to placebo control groups). 
These studies can be viewed as evidence of 
the necessity of emotional arousal as an im- 
portant component in the avoidance of re- 
sponses with aversive consequences, with an 
increase in arousal facilitating that avoidance 
and a decrease attenuating avoidance. How- 
ever, since only the degree of arousal was 
manipulated, and not interferences the sub- 
ject might make concerning causal attribu- 
tion, the necessity of arousal was demon- 


Strated but the issue of the sufficiency of 
arousal was not addressed 


By illustrating the role of emotional 


arousal, these data provided support for 
Mowrer’s (1950) two-factor avoidance 
model. — Mowrer’ 


: 5 model emphasized that 
the emotional response that the organism 
developed 1n response to the cues signaling 


an aversive event would motivate the avoid- 
ance 


Tesponse, which, when successful, 
: lead to the reinforcing reduction of 
the negative emotional state? ^ Emotional 


arousal in anticipation of punishment for a 
potential transgression should, by this inter- 
pretation, lead to the avoidance of the temp- 
tation behavior. But Schachter and Latané 
(1964) spectulated that for the psychopath, 
the avoidance of behavior with aversive con- 


———— 


* More modern fheorists have extended this 
Werspective, suggesting that anxiety becomes con- 
ditioned specifically to the “behavioral and cog- 
i dia of the act” (Aronfreed, 1968, pp. 


EworION-ATTRIBUTION AND MORAL BEHAVIOR 301 


sequences depended on more than the mere 
presence of arousal. Psychopaths experi- 
ence a high level of physiological arousal 
with no apparent subjective emotional dis- 
tress in response to cues that would evoke 
an emotional response in normal individuals. 
Schachter and Latané suggested that it was 
the chronic inability of the psychopath to ex- 
perience his physiological arousal as emotion 
that accounted for his moral insensitivity.* 
These suggestions approach the perspective 
proposed in this article, but our view differs 
by suggesting that even in temptation situa- 
tions where arousal is experienced as emo- 
tion, specific causal attributions are crucial 
in determining the behavioral outcome. 

Tt was the Schachter and Latané (1964) 
research that led to a series of studies by the 
first author indicating that college-age men 
and women more often succumbed to the 
temptation to cheat when they could attrib- 
ute the emotional arousal associated with 
cheating to a placebo pill, which suppos- 
edly had associated emotional side effects 
(Dienstbier, 1972; Dienstbier & Munter, 
1971). Whereas Schachter and Singer 
(1962) had demonstrated that external cues 
are important in allowing the attribution of 
and experiencing of artificially induced 
arousal as specific emotion, the cheating re- 
search demonstrated the reverse—that nat- 
urally induced specific emotions could be in- 
terpreted as nonspecific arousal when sub- 
jects attributed the associated arousal to a 
nonemotionally relevant source such as the 
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2 Suggestions by Mandler (1964) and the theo- 
retical structure proposed by Lazarus (1908) are 
ul in understanding possible mechanisms to 
account for the psychopath's lack of emotional ex- 
ence in the face of arousal. Mandler suggested 
od of the typical psychopath is 
frequent emotional upheavals and 
1 might learn to minimize the 
ese frequent traumas by 
signals from his body, 
ribute to a strong emo- 
accentuate his 
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that the individua 
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Ene suggestions fit well with Lazarus's 
(1968) concept of benign reappraisal, the process 
by which the stress reaction is reduced by a € 
tive reappraisal involving a lowered a e. i z 
the potential danger m a situation after the ma 
idus s adjudged that he is powerless to affect 
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vidual ha 
the outcome. 


placebo pill. Since these cheating studies 
are important for the theoretical perspective 
developed in this article, they are reviewed 
here in some detail. 

Recruited as participants in a study on the 
effects of a vitamin supplement on vision, 
subjects were given a vocabulary test as a 
"delay task while waiting for the pill to take 
effect." Besides anticipating visual effects 
irom the pill, a random half of the subjects 
were led to anticipate arousal effects of “a 
pounding heart, hand tremor, sweaty palms, 
a warm or flushed face, and a tight or sink- 
ing feeling in the stomach" (following Nis- 
bett & Schachter, 1966). Control subjects 
anticipated benign side effects irrelevant to 
arousal. After the vocabulary test, to vali- 
date the potency of the pill, subjects viewed 
the auto-kinetic illusion, and were led to un- 
derstand that the pill was responsible for 
their seeing the immobile light as moving. 
"Thereafter, subjects were given the opportu- 
nity to check over their vocabulary test an- 
swers and to see the correct answers. They 
were informed that the department develop- 
ing the vocabulary test was interested in 
using it for future prediction of college suc- 
cess, and that the "board of psychologists" 
responsible for it would like to interview any 
students who did particularly poorly on the 
test (less than 20 right). The test was de- 
signed so that less than 166 of college fresh- 
men could attain that score without cheating. 

The period during which the individual 
could look over the correct answers to the 
vocabulary test and his own answers was 
also introduced as a "delay period," in this 
case prior to the rating of side effects because 
"it was important that the subjects rate the 
side effects they might be experiencing just 
9 minutes after they had experienced the 
visual effects." The salience of the side ef- 
fects expectation was increased for that pe- 
riod by having subjects look over a side ef- 
fects checklist at the onset of that period. 
Any cheating that took place during the de- 
lay period could be detected because pres- 
sure-sensitive paper (making a second copy 
of the answers) was placed under the answer 
paper during the taking of the test: the 
answer paper was removed by the subjects 
prior to the cheating period, so that there- 
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after any changes made on the real answer 
paper would not correspond to those on the 
pressure-sensitive paper. 

In the first study (Dienstbier & Munter, 
1971) 49% of the subjects who anticipated 
arousal side effects from their placebo pills 
cheated, compared to 2795 of the control 
condition, with cheating defined as chang- 
ing any answers. This significant. differ- 
ence became more impressive when it was 
noted that the entire effect was due to the 
male subjects, whose cheating rates were 
56% and 17% in the two conditions, respec- 
tively. Follow-up research (Dienstbier, 
1972) indicated that women, too, respond to 
the arousal placebo with increased cheating 
if the threat of the consequences for failure 
is made less frightening. (it appeared that 
in the original study the more intense an- 
xiety experienced by women in response to 
the threat of being called by the “board of 
psychologists” interfered with their attend- 
ing to the pill side effects). In a second 
study reported in the same article, it was 
further determined that the effect of the 
arousal placebo in reducing resistance to 
temptation was indeed due to the subject's 
attributing specifically to the pill—merely 
attending to arousal feelings without having 
taken the placebo did not produce the effect, 

It appeared that all subjects experienced 
emotional arousal while confronting the 
temptation to cheat, largely as a result of 
considering the act of cheating and the po- 
tential negative consequences. But when 
subjects could attribute the cause of their 
arousal to the pill, that ar 


Ousal ceased to 
have an inhibiting effect. Tt is not, in other 


words, that the subjects in either pill condi- 
tion experienced more or less arousa], but 
rather that the attribution of the arousal to 
a neutral source (in the arousal pill condi- 
tion) defused that arousal, Preventing it 
from having emotional relevance, 
a 

5 There are two lines of evidence Suggesting that 
subjects make the attributions of emotiona] causal. 
ity at a less than verbally conscious level, First 
when subjects were interviewed, none ever indi. 
cated conscious awareness of such a decisional 
process, (Subjects were completely debriefeq and 
questioned concerning possible suspi F 


: : Ciousness foj. 
lowing the cheating research.) econd, at the Ea: 
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Whereas Schachter and Latané (1964) 
had speculated that this process in chronic 
form accounted for the moral insensitivity of 
the psychopath, the cheating studies sug- 
gested that the same process could be dem- 
onstrated to produce episodes of moral in- 
sensitivity in normal subjects. 

The work of Solomon and W ynne (1954) 
and subsequent researchers emphasized that 
although emotional arousal was necessary in 
the acquisition of avoidance responses, the 
elimination of overt signs of arousal was 
often observed once such responses had been 
well learned. Except for the Schachter and 
Ono data (cited in Schachter & Latané, 
1964), which demonstrated increased cheat- 
ing with chloropromazine, we would not 
have anticipated, therefore, that emotional 
arousal played any role in moral decision 
making for normal adults, Yet the cheat- 
ing results suggested that even for 19-year- 
olds in temptation situations, the avoidance 
of transgression is still heavily dependent on 
emotional arousal, Additionally, the studies 
demonstrated that although 19-year-old 
adults should have had considerable experi- 
ence in learning which features of their own 
behavior and of the environment lead to 
negative emotional arousal they were sur- 
Prisingly susceptible to external influences 


in attributing arousal experienced in the face 
of temptation, 


But however powerful these data are, they 


do not provide the theoretical bridge to our 
final point, Our thesis requires that we 
demonstrate more than the defusing of emo- 
tional arousal through attribution to a non- 
emotional source in a context as artificial as 
that associated with the placebo pill studies. 
Ve must demonstrate instead that emotional 


of the cheating period, when ar 
the degree to which the 


formation concerning e 


Xpected symptoms is given 
thereafter the fica 


a? 
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arousal may be defused for moral behavior 
even though still experienced as negative 
emotion, through the attribution to an emo- 
tion-inducing source that has no direct rele- 
vance to the moral decision. 


SocIALIZATION AND EMOTION ATTRIBUTION 


We speculated that children would be 
even more receptive than adults had proven 
to be to cues that would suggest different 
causal attributions for emotional arousal, and 
that these cues would play an important role 
in determining the impact of socialization 
processes. Specifically, if a socialization pat- 
tern left the child with the impression that 
the cause of his emotional arousal or dis- 
comfort was his own misbehavior, the impact 
of this internal orientation on the child’s be- 
havior would be very different from the im- 
pact of an external orientation in which 
equally intense emotion was thought to be 
caused by fear of punishment. The latter at- 
tribution would be particularly ineffective in 
situations in which the tempted individual 
could be certain that detection was impos- 
sible. 

Even early in their socialization history, 
most normal children will experience some 
emotional arousal following transgression 
and detection by a disapproving adult. Sev- 
eral features of the situation naturally lead 
to that arousal: the child's knowledge that 
the adult disapproves, previous scoldings 
now recalled, anxiety over the uncertainty of 
the outcome, and the likelihood that the adult 
will emphasize the discrepancy between the 
child's behavior and higher standards. The 
manner in which the socializing agent re- 
sponds to the child’s transgression will influ- 
ence the causal attributions. the child will 
make about his emotional discomfort. So- 
cialization procedures that draw the child's 
attention to the transgression rather than to 
the aftermath (confrontation and punish- 
ment) facilitate the attribution of arousal to 
the act of transgression. — Generally, such 

cedures involve discussions of the trans- 
Pee in a manner that does not threaten 
gu so that the child is able to dwell 
the ee ion itself and the behavioral 


ssi 

he transgression | 

oii he has violated, rather than on 
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the threat that confrontation offers. Often 
the self-concept is manipulated (e.g., “you 
are a bad boy”) in relation to the standard 
or behavior (“because you didn’t listen and 
lied”). The attribution of arousal to the 
anticipated transgression later leads the child 
to avoid the transgression itself. On the 
other hand, responses by socializing agents 
that threaten the child or that become salient 
through pain draw sufficient attention away 
from the transgression so that the child is 
likely to attribute his emotional discomfort 
exclusively to confrontation with the social- 
izing agent. When later facing a similar 
temptation situation, a child treated in the 
latter manner might experience a high level 
of emotional arousal, but by attributing his 
arousal to fear of detection and punishment 
he would tend to resist temptation only if 
he believed that detection was likely. 

In our research with children we wished 
to buttress our theory by establishing a temp- 
tation and transgression situation in which 
the emotional arousal following detection 
would be equal for the two groups of chil- 
dren, with the children assured that future 
transgressions could not be detected. We 
then imparted to the children an internal 
(due to own behavior) or an external (due 
to confrontation) causal attribution concern- 
ing the source of that emotional arousal. It 
was hypothesized that the children who be- 
lieved that they “felt bad" for their previous 
transgression (internal orientation, or guilt) 
would misbehave far less than children in 
the same situation who believed that they 
"felt bad" because their previous transgres- 
sion had been detected (external orientation, 
or shame). 


Srupy 1 


Method 


In Study 1 of the two studies with children, scc- 
ond-grade subjects were brought, one at a time, 
to a “new toy room” in their school and were 
asked to keep watching a slot car go slowly around 
an oval track during the experimenter’s absence; 
they were to stop the car with a control device if 
the car began to go too fast. Although the child 
was given sufficient justification for watching the 
car, other electrical toys "with loose wires" con- 
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trolled by a hidden observer * were started and 
stopped according to a schedule to distract the 
child. If the child transgressed by allowing him- 
self to be distracted from the implied duty of 
watching the slot car, the observer caused the slot 
car to jump the track (the event that was sup- 
posed to be avoided) and simultaneously signaled 
the experimenter to return. 

Pretending that the slot car was broken, the ex- 
perimenter asked the child to accompany him toa 
separate room, where the independent manipulation 
was administered: labeling the child’s feelings as 
due to transgression (guilt) or due to being found 
out following the transgression (shame). The 
child was then returned to the room, taught to 
operate the slot car correctly, and assured that 
this “new” car would not break if it jumped the 
track; he was also assured that the door would 
now be locked and the experimenter did not have 


a key, implying that detection of transgression of 


iling to watch the slot car would be impossible. 
he dependent measure was the total time spent 
not attending to the slot car during this second 
watching phase of 12 minutes (with the experi- 
menter absent). A reliability of .99 was obtained 


between two observers for five subjects; thereafter 
one observer was used. 

One male and one female experimenter partici- 
pated,® each using boys and girls as subjects, so 


that with the independent manipulation of guilt 


versus shame, a 2X2X2 factorial design was 
effected, 


Results 


Results indicated no significant effects for 


either the sex of experimenter or children, 
but the nonsignificant difference between the 
mean seconds transgressing for the shame 
condition (164) and the guilt condition 
(102) were Suggestive. A further 
indicated that most of the demonstr 
creased effectiveness for the guilt m 
tion was due to the female experin 
187 seconds transgressing in the sh. 
dition, compared to 80 seconds in the guilt 
condition, #(22) = 1.91, b < O7, two tailed, 
It was our impression that a real difference 
did exist, despite the lack of a Significant 
main effect for the shame versus guilt condi. 
tions or an experimenter by condition inter- 
action. It was suspected tl 


iat the differences 
between experimenters were due to their dif- 
ferent experience levels with children (the 


more effective one was an elementary school 


analysis 
ated in- 
anipula- 
nenter— 
ame con- 


* A role filled by the second and fifth 


; authors, 
* A role filled by the third and fourth 


authors, 
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teacher and a parent) rather than a true sex 
difference. 

The inconclusiveness of the findings of 
Study 1 led to the changes incorporated into 
the second study, which will be presented in 
somewhat greater detail. 


Strupy 2 


In an effort to reduce the variations in 
background between children in the two con- 
ditions and to allow a matched-pair type of 
analysis, second-grade twins were recruited 
by seeking the cooperation of their parents 
and of the twins themselves. Except for 
some minor changes in procedure and script 
and the participation of only one experi- 
menter (the first author, who fit the cri- 
terion of having sufficient experience with 
children), the design of the study was essen- 
tially the same as for Study Í. Children 
were not considered separately by sex, since 


the results of Study 1 suggested no sex 
differences, 


Method 


The first twin to Participate w 
domly and brought to the 
other twin played quietly with an assistant in the 
waiting room. The participating twin was seated 
in front of the booth and was told that the slot 
car presently on the track was very old so it might 
break if it did fall off the track, but that the prob- 
lem with the cars sometimes going too fast was in 
the track. The experimenter's first absence was so 
that she could “go to call the man who fixes the 
toys." The Scheduled use of distraction toys such 
as the electric train “with the loose wires” was 
AT dB n the first phase of Study 1. 
Although time before the slot car 

and the booth light would go 
the fi was 6 continuous seconds of 
distraction, if the child had not met that criterion 
by the end of the 6th minute, the criterion time was 
nd for each passing minute of 


Ma PES Child. Since the members of 
twin pair tended to act remarkably alike, their 
articipation times tended to be very 


as chosen ran- 
toy room while the 


the same as i 


Immediately following 

the experimenter returned, 
the transgression and the 
Experimenter asked the chil 
the adjoining room, where 
experimental condition 
Propriate manipul 
Script (the secti 
additions that ch 
shame script, or 


the child’s transgression, 

Upon “discovering” 
"broken" slot car, the 
d to accompany him to 
he identified the twin's 
d and administered the ap- 
ation according to the following 
Ons in parentheses indicate the 
ange the basic guilt script to the 
where two sets of parentheses oc- 


x 


—— 
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cur together, the first is the shame section and the 
second the guilt section : 


I bet you feel a little bad now that (I found out) 
the car fell off. I've seen other kids feel bad 
when (someone found out) they weren't able to 
do exactly what they were supposed to do. 
When the other kids who tried to watch the car 
couldn't, they felt bad (when I found out) too. 

But its important that we do stop the car 
when the light first comes on, and before it flies 
of the track, so we'll try again with this new car. 
You won't feel bad if (I find this car still) (this 
car stays) on the track—if (I find) you've done 
a good job, you'll feel good, won't you? 

Have you noticed that when other kids (can 
show people that they) have done the right thing, 
that they feel good? And when (people find out) 
they did things they were not supposed to, they 
feel bad, don’t they? (When someone finds out) 
(Even if no one ever finds out) they feel bad 
for not doing what they were supposed to do, 


don’t they? 


After returning to the toy room, the child was 
directly informed that since he now was being 
taught to start and control the new unbreakable 
car, and since the door would be locked and the 
experimenter had no key, that “even if it did fly 
off the track I'd never find out, since you could put 
it back by yourself and I can't come right in with 
the door locked." Most children spontaneously 
acknowledged that this was true. With that state- 
ment, the experimenter signaled the observer ê to 
put his hand over his ears while the final part of 
the independent manipulation was administered : 
"Remember, you'd feel bad if (I ever found out 
you did a bad job) (you did a bad job, even if I 
never found out). There is no way that I could 
find out this time, though." The experimenter then 
left the room, locking the door and very loudly 
checking it to make sure it was adequately locked. 

A 12-minute period, spiced with the same dis- 
tracting toys starting up and the same dependent 
measure (the time spent not attending to the slot 
cars) was identical to the second phase of Study 1. 


Results and Conclusions 


Of the nine pairs of twins who came to the 
laboratory, complete data were gathered for 


six pairs. Two pairs were lost because one 


of the twins failed to follow instructions by 
hat he would not 


shutting off the slot car 50 t 

have to watch it. One pair was not run be- 
cause the twins were hyperactive and inat- 
tentive. For each of the six pairs run, the 
twin receiving the shame instructions trans- 
gressed almost twice as much as the twin in 


rg ee 
6 Thanks are due Steve Slane for his faithful and 
competent filling of the observer role. 


E 


—— -. Fraternal Twins 


— litical Twas 


400 


SECONDS TRANSORESSING 


m 


CONDITION 


essing for the members 


Ficure 1. Seconds transgr ne 
ording to condition. 


of each pair of twins acc 


the guilt condition. The average time trans- 
gressing was 322 seconds for the shame con- 
dition, and 177 seconds for the guilt condi- 
tion £(5) = 80, p < .001, two tailed. The 
regularity of the effect across pairs of twins 
is illustrated by Figure 1. 

It is apparent that 8-year-old children are 
sufficiently cognitively sophisticated so that 
the differences between the guilt and shame 
manipulations are perceived, even in the face 
of the anxiety elicited by the confrontation 
with a mildly disapproving and strange 
adult. Additionally, the findings of 60% 
(Study 1) and 8076 (Study 2) increases in 
transgression time for shame subjects over 
guilt subjects support conclusions from the 
adult cheating studies—the attribution of 
emotional causality has a large impact on the 
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behavior of children in resistance-to-tempta- 
tion situations. 

The degree to which the two child studies 
are acceptable as powerful support of the 
theoretical development is dependent on sev- 
eral assumptions that cannot be empirically 
demonstrated. Although we endeavored to 
maintain equal levels of emotional arousal 
for the children in the two conditions, and 
although there are no apparent reasons why 
the shame or guilt script variations might 
have induced different arousal levels, no di- 
rect assessment of emotional arousal was 
made. Also, we cannot be certain that the 
greater effectiveness of the guilt script was 
not due to one of the scripts drawing the 
subject’s attention to considerations irrele- 
vant to the attribution of emotional arousal. 
It might have appeared, for example, that 
the experimenter was less interested in the 
behavior of the children in the shame con- 
dition. Although the similarity of the scripts 
was meant to preclude this possibility, it is 
impossible to be certain that the script dif- 
ferences did not trigger different reactions 
between the children in the two conditions, 
However, since neither of these two ex- 
planations could easily be used to explain 
the results of the Cheating studies with 
adults, the hypothesis involving the causal 
attributions of emotional arousal seems the 
only likely explanation common to both re- 
search approaches. Although the following 
discussion assumes that the child studies 
provide strong empirical support for the 
theoretical developments, we believe that it 
is possible to follow and accept the theoreti- 
cal aspects of our thesis without being totally 
convinced of the relevance of the data from 
the child studies, 

One possible approach to explaining the 
high dependence of our child and adult sub- 
jects on external cues rather than past ex. 
perience in making causal attributions of 
emotional arousal is to suggest that this flex. 
ibility is greatest when the emotiona] state 
in question is an uncomfortable one, 
support-by-default for this view comes 
the emotion-attribution literature ; 
pact of causal attribution manipulations has 
been convincingly demonstrated on 


t ly with 
the negative emotions of anxiety, fear, and 


Some 
from 
the im- 


startle response (Dienstbier, 1972; Dienst- 
bier & Munter, 1971 ; Nisbett & Schachter, 
1966; Ross, Rodin, & Zimbardo, 1969; 
Storms & Nisbett, 1970; Zanna & Cooper, 
1974). In the first author’s research, sev- 
eral failures to observe the causal reattribu- 
tion of arousal associated with emotions not 
related to fear reinforced this view. Fur- 
ther support is provided by the research of 
Lazarus and his associates (Lazarus, 1968), 
who have demonstrated a readiness in their 
subjects to use cognitive information that al- 
lows the reduction of stress. 

Another approach to understanding the 
apparent ease with which the children ac- 
cepted the attribution of arousal to their 
own behavior or internal causes rather than 
to detection or external causes is to focus on 
the readiness of children in this age category 
to accept such internal attributions, Wol- 
man, Lewis, and King (1971) demonstrated 
that as children pass from “young” (ages 
5-7) through “middle” (ages 8-9) to “old” 
(ages 10-13) categories, they report that the 
conditions that stimulate emotions more fre- 
quently come from within themselves, Ka- 
gan (1971) has summarized a wide range of 
literature suggesting that the child’s capacity 
for guilt increases most rapidly between the 
ages of 4 and 10, 


RELEVANCE TO Mayor THEORETICAL 


APPROACHES 
Social Learning Approaches 


Influenced heavily by traditional learning 
theory on the one hand, and by the psycho- 
analytic Concept of identification on the other, 
the social learning approach to morality has 
Progressed in the direction of à 
of those features of the model t 
identification and those features of discipline 
techniques that facilitate resistance to temp- 
tation and appropriate affect, 

Since the impact of modeling and identifi- 
cation research appears to be waning, we will 
not attempt an integration of our view with 
that branch of the Social learning literature.” 


the analyses 
hat facilitate 


typified by the work 
his associates (Bandura, 1968) and 

s, Maccoby, & Levin, 
1957; Sears, Rau, & Alpert, 1 y, à 


P 965), has never freed 
Tom the issues of identification versus simple 


al 
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Instead, our discussion will be concerned 
with socialization techniques and the impact 
of those techniques on the affective and cog- 
nitive responses that facilitate resistance to 
temptation. Several issues in the social 
learning and the socialization-of-morality 
literature will be viewed from our theoreti- 
cal perspective. 

Punishment. As a behavior modification 
technique, and as demonstrated in many 
laboratory studies, punishment (including 
physical) is often a viable control technique. 
That punishment is effective in behavioral 
control (so that its various parameters bear 
analysis) is seldom questioned, and those 
laboratory studies that have compared pun- 
ished and nonpunished control groups have 
generally demonstrated the superiority of 
punishment in behavioral suppression (see 
Johnston, 1972, for a recent review). But 
the socialization literature based on natural- 
istic observation in home and school settings 
indicates that the use of physical punishment 
is negatively associated with the internaliza- 
tion of social control ( Bacon, Child, & Barry, 
1963; MacKinnon, 1938; Maurer, 1974; 
Nowicki & Segal, 1974; Sears, Maccoby, & 
Levin, 1957; Sears, Rau, & Albert, 1965). 

The dilemma of physical or power-ori- 
ented punishment being successful in the 
laboratory and counterproductive in less 
controlled settings, particularly as compared 
to induction, relates to our approach accord- 
ing to the following analysis. When all 
other things are equal, the more severe the 
punishment, the more the emotional arousal 
and the more the resistance to deviation in 
the future, wnless the more intense punish- 
ment causes different attributions about the 
source of the emotional discomfort associated 
with that situation or distracts from other 
administered messages relevant to emotional 


imitation, and in the 1970s seems to have lost its 
appeal as a research area. The failure of many ma- 
jor studies in this area to find convincing evidence 
of causal links between identification and resistance 
to temptations (ie, Sears ct al, 1965) has led 
the author of one significant recent review to sug- 
gest that the evidence linking morality and identi- 
fication was meager at best (Hoffman, 1970). 
While the disinhibiting effects of a transgressing 
model seemed established, the inhibiting effects of 


a self-denying model were not. 


attribution. The first part of that statement 
is an affirmation of classical avoidance the- 
ory—that greater arousal will lead to more 
resistance to deviation, But, as is also sug- 
gested, intense physical punishment should 
indeed cause different attributions of arousal 
from either mild punishment or other dis- 
ciplinary techniques that do not direct atten- 
tion away from the transgression. If the 
physically punished individual faces future 
temptation in situations in which detection 
seems unlikely, then we would predict a par- 
adoxical effect, with more intense punish- 
ment or more physically oriented punishment 
leading to less resistance to temptation, 

Punishment that is love oriented, how- 
ever, seems less destructive in its impact on 
the development of resistance: to temptation, 
There are several reasons why such tech- 
niques are less counterproductive than phys- 
ical discipline and why they are found to be 
positive in some studies, While physical 
punishment is usually accompanied by pa- 
rental expressions of anger, encouraging 
Similar expressions in the child, love-with- 
drawal techniques are associated with anger 
inhibition ( Hoffman, 1970) in the child ; the 
experiencing of anger by the child suggests 
that emotional arousal is being attributed to 
confrontation with the parent. Addition- 
ally, the love-withdrawal techniques tend to 
leave the child in a state of emotional dis- 
comfort for longer periods, with the termi- 
nation of such punishment often dependent 
on some action or statement by the child 
that relates to the child's initial transgres- 
sion. Expressions such as "I'm sorry that 
I did that" are emotional attribution state- 
ments frequently encouraged by parents in 
psychological punishment situations prior to 
the termination of the punishment. That 
love-oriented techniques have sometimes 
been found to be most effective when prac- 
ticed by warm parents (ie, Sears et al, 
1957) may be due to the child's finding it 
more difficult to attribute his discomfort to 
the warm parents’ behavior, rather than to 
his own. The fact that with greater warmth, 
there is more love to withdraw in the pun- 
ishment situation is also relevant. 
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Due to the unfamiliarity and associated 
uncertainty that the young subject experi- 
ences in the laboratory, combined with 
(usually, in our view) insufficient assurances 
of the undetectability of deviance, the lab- 
oratory provides neither a setting in which 
the exclusive attribution of arousal to pun- 
ishment is likely nor one in which feelings 
of total freedom from detection are likely. 
In laboratory studies the punished child 
therefore usually appears more selíf-con- 
trolled than the nonpunished control. Un- 
fortunately, a complete resolution of this 
point is not possible by reference to the mo- 
rality literature, since researchers seldom at- 
tempt to provide evidence that their children 
were convinced that their transgressions 
could not be detected. Given that the labo- 
ratory is often quite unfamiliar to the re- 
search subjects, for young children to be 
secure in their belief of undetectability con- 
siderable effort should be devoted to the 
staging of experimental situations; if pos- 
sible, the impossiblility of detection should 
be articulated to the child (as in Study 2). 

Naturalistic settings often permit the child 
to make causal attributions that negate the 
effect of punishment and lead to freedom 
from detection. In a home or classroom 
the harshly or frequently punished child has 
repeated opportunities to make accurate at- 
tributions that the negative emotions he ex- 
periences are due to confrontation and pun- 
ishment rather than transgression and to be- 
come certain about when detection is pos- 
sible. Those accurate emotional attributions 
facilitated by repeated experience in natural 
settings are not conducive to internalization, 
With the development of accurate attribu- 
tions about the source of negative emotional 
arousal, either of two types of results may 
obtain (probably most such situations result 
in some of each of these two processes), 
The first process has been emphasized by 
those social learning theorists who have been 
heavily influenced by the traditional avoid- 
ance model exemplified by Mowrer's (1950) 
work, but who must explain the data that 
indicate the ineffectiveness of punishment in 
natural settings. Simply stated, that view 
suggests that the punished child learns accu- 
rate discriminatory cues so that no arousal 


results during transgressions that occur !n 
the absence of punishing agents. While it 1s 
apparent that such a process may take place, 
we would argue that there are a great many 
temptation situations for normal individuals 
in which arousal is elicited even though the 
individual is fully aware that detection is 
impossible; both the cheating studies and 
our child studies provided situations de- 
signed to make detection appear impossible, 
yet arousal and the causal attributions made 
about that arousal played a key role in deter- 
mining the outcome. 

We seek what we perceive to be a neces- 
sary middle ground between older theories, 
which emphasize automatic conditioning pro- 
cedures to the exclusion of cognitive proc- 
esses, and more modern approaches, which 
emphasize total cognitive control over the 
learning of contingencies. The older view 
(Mowrer, 1950), suggesting a classical-con- 
ditioning-like mechanism to account for the 
attachment of negative emotions to situations 
and behavior, could not account for data 
such as that offered in the cheating studies 
or the child studies, in which cognitive inter- 
pretations given to subjects relevant to the 
meaning of their emotional arousal had im- 
portant effects. More modern approaches, 
such as that of Bindra (1974), suggest that 
reinforcing or punishing events do not di- 
rectly affect the strength of the preceding 
response ; rather what is learned are the 

contingent. relations” existing between the 
events of the situation, Bandura (1969, p. 
553) suggests a similar relationship in stat- 
ing that the classical-conditioning-like aver- 
Sive techniques used in behavior modifica- 
tion procedures affect the client only to the 
extent that he is consciously willing to recall 
and revisualize or reexperience the therapy 
Situation when faced with temptation in later 
Outside situations, 

Our view suggests a greater degree of 
autonomy between cognition and emotion, 
With cognition influenced by, but not under 
the total control of emotional arousal, and 
emotional arousal likewise influenced by, but 
not completely controlled by cognition inter- 
pretations. Thus the cognitive knowledge 
Possessed by particularly the very young 
child that his emotional upset in the face of 


p 


EMOTION-ATTRIBUTION 


temptation stems directly from previous 
punishment may not eliminate that emotion 
when detection is thought to be impossible; 
rather, such a causal attribution will make 
the emotion irrelevant and will allow the 
child to act somewhat as if the emotion 
were not present. To illustrate this cogni- 
tion-emotion interaction, consider behavior 
in the face of a mild phobia. The fear ex- 
perienced by an acrophobic adult while 
climbing an open stairway may be as in- 
tense when the fear is cognitively perceived 
to be unjustified (the stairs are safe) as 
when it is perceived to be justified (the 
stairs may break); yet through moderate 
fear levels, such cognitions will determine 
whether the individual will override his emo- 
tional arousal and continue to climb. We 
present the acrophobia illustration as more 
than a metaphor and suggest that many 
of the more important moral prohibitions 
learned by individuals with well-developed 
consciences have a phobialike character, with 
emotional arousal elicited even though the 
tempted individual may cognitively appraise 
the situation as perfectly safe from detec- 
tion, Such an appraisal allows the emotion 
to be overridden without being eliminated if 
the individual attributes his arousal to a fear 
of detection or to other causes that may be 
irrelevant if detection is impossible.* 
Recent research by Loftis and Ross 
(1974) suggests the possibility of a second 
effect resulting from different causal attribu- 
tions. After the acquisition of an emotional 
response, the extinction rate for that re- 
sponse may depend in part on the source to 
which the individual retrospectively attrib- 
utes his emotional arousal. In their re- 
search, a tone was heard while a shock- 
induced emotional reaction, as measured by 
galvanic skin response (GSR), was condi- 
tioned to a light. When later instructions 


8 An extreme example of an apparently similar 


process is found in the work of Fenz and Epstein 
(Epstein, 1967) with skydivers. Although emo- 
tional arousal as indicated by physiological moni- 
toring remains high as the skydiver gains experi- 
ence, his view of the sport as not posing objective 
danger allows the enthusiast to jump. The ea 
ing attributions fostered by experience allow the 
experience of intense excitement where previously 


fear had dominated. 
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indicated that the emotional arousal could 
be attributed to the tone, extinction of the 
GSR was faster (compared to a control 
group for which the tone was not described 
as arousing), even though no tone was used 
during extinction. Thus, although our 
view emphasizes that emotional arousal will 
have differential effects depending on the 
causal attributions made, we do not exclude 
the process of different attributions eventu- 
ally affecting arousal level. With respect to 
moral behavior, the two processes should 
tend to influence responses similarly, with 
attributions that arousal is due to irrelevant 
sources (such as fear of punishment when 
detection is not possible) both negating the 
effect of the arousal and, with repeated ex- 
perience, leading to a reduction in emo- 
tional arousal as well. à 
Techniques such as induction (Hoffman, 
1970), which do not depend on punishment, 
will undoubtedly result in less negative emo- 
tional arousal associated with transgression 
than techniques based on punishment; but 
successful induction, involving the explana- 
tion to the child of why certain behaviors 
are right or wrong, focuses attention on the 
behavior per se, so that the arousal that is 
present during such socialization episodes 
will be attributed to the act of transgression 
rather than to the threat of punishment. 
Hoffman's review indicates that induction is 
more effective in moral socialization than 
punishment-based techniques, even includ- 
ing love-oriented punishment. Our empha- 
sis on the necessity of arousal for induction 
to be effective is relevant to Aronfreed’s 
(1968, p. 9) suggestion that "available evi- 
dence usually points to great discrepancies 
between children's verbal expression of eval- 
uative standards and their actual behavior in 
a real social context." The data of Henshel 
(1971), indicating increasing correlations 
(from fourth to seventh grade) between 
honesty assessed verbally and resistance to 
temptation in a cheating task, provide the 
basis for an explanation of the discontinuity 
between verbal expression and moral be- 
havior noted by Aronfreed. Henshel sug- 
gested that behavior is not dependent as 
much on simply "knowing" what is right as 
it is on “feeling what is ‘desirable. ” Unless 
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standards become associated with emotional 
responses, children will be as free to violate 
these standards as were the subjects in the 


cheating studies who could attribute their 
arousal to a pill. 


Guilt and resistance to temptation, 


A sec- 
ond dilemma from the Social learnin 


g liter- 


es) and resistance to 
temptation, some studies have failed to show 
such a correspondence (Allinsmith, 1960; 
Burton, Maccoby, & Allinsmith, 1961; Mac- 
coby, 1959). 

Some authors (e.g., Hoffman, 1970) 
have Suggested that one of the major prob- 
lems with that literature is that researchers 
endeavored to measure guilt in 


preting 
1 cited, 
at explana- 


Projective measures of guilt is ofter 
Our approach lends support to th. 


ame trans- 
he cheating 
ach emphasizes 
y on immediate 


gression, 
and child research, our appro 
the high degree of dependence 
situational cues jn attributing Causation of 
emotional arousal. Thus it is quite likely 
that individuals who make causal attribu- 


; of arousal 
(guilt) in one situation will find that either 


ake more extern 
attributions concerning emotional causality 


(shame or fear of punishment) in other 


- Significant correla- 
tions between guilt and resi 


study that measured guilt 
temptation with respect to 
transgression found that 
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strong negative external inducement ( threat) 
has been demonstrated by researchers in- 
vestigating a hypothesis derived from cogni- 
tive dissonance theory. That hypothesis 
stated that children who receive a mild 
threat to inhibit an act will take more per- 
sonal responsibility for resisting temptation 
than will severely threatened children; they 
will therefore experience more cognitive dis- 
Sonance over resisting temptation. Aronson 
and Carlsmith (1963) and Freedman 
(1965) demonstrated that when children 
were given a mild threat to not play with a 
valued toy, they valued the forbidden toy 
less (Aronson & Carlsmith) and played 
with it less (Freedman) than was the case 
under severe threat, The sophisticated con- 
trol conditions of the Freedman study (find- 
ing no threat-effect differences when there 
was no Opportunity to play with the toy 
during the threat-relevant period) allowed 
the elimination of many potential explana- 


tions alternative to the dissonance-derived 
hypothesis, 


A more recent liter: 
indicating similar effe 
Strong positive 


ature has developed 
cts for unnecessarily 
inducements ( reinforce. 
ing of deCharmg (1968) 
of perceived locus of con- 
led to rese 


ct by Deci (1971, 1972), 
» and Lewis 
and Nisbett (1973), This 
an individual 
a task he would 
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about the internal cause of positive and 
negative emotions. But by focusing as it 
does on the ability to change causal attribu- 
tions of emotion associated with and resulting 
from behavior after the behavior has been 
performed, our approach suggests that rein- 
forcement or punishment given after the fact 
will often create effects similar to induce- 
ments and threats made before the fact. This 
conclusion from our studies with children is 
reinforced by the findings on "retrospective 
misattribution" by Loftis and Ross (1974) 
cited earlier. The importance of this exten- 
tion becomes more obvious when we con- 
sider that socialization efforts are usually 
made after some motivational failure has 
occurred. 

As suggested earlier, the extent to which 
punishment or reinforcement draws atten- 
tion to itself rather than to the preceding be- 
havior will be influenced not only by the 
specific form of punishment or reinforce- 
ment but also by the style of administration 
and the situation. Thus while tangible rein- 
forcers, such as physical punishment, are 
likely to draw the attribution of emotional 
response toward themselves rather than to 
the preceding behavior, certain approaches 
may accentuate that process. Information 
that suggests that the administered rein- 
forcement should be sufficient compensation 
and an adequate inducement for future per- 
formance should increase the likelihood that 
pleasure derived from the task itself (from 
the means, as well as from the completion) 
will be attributed instead to the reinforce- 
ment. 

This approach may be useful in illuminat- 
ing an apparent contradiction in the re- 
cent overjustification literature. Unexpected 
prizes administered to a group of game-win- 
ning students by Kruglanski et al. (1972) 
had the effect of reducing liking for those 
competitive games (compared to nonre- 
warded winners), while children who re- 
ceived unexpected prizes for good drawing 
in the Lepper et al. (1973) study did not 
later behave as if their prizes caused them 
to devalue the special drawing task used. 


From our perspective, the critical difference 


lies in the changed attributions resulting 
from Kruglanski et al. telling their subjects 


that they had been promised the rewards 
prior to the games (which in fact was not 
true) and in giving tangible reinforcers 
(plastic puzzles), which were salient during 
the game ratings because they were given 
before the rating. The positive emotional 
response resulting from playing and win- 
ning the games was apparently partially at- 
tributed to the prizes. In the Lepper et al. 
study, however, the unexpected reward 
group was not misled to believe that their 
rewards had been previously promised nor 
were those rewards salient at the time the 
dependent measure was taken; their awards 
had been posted (several days previously) 
on an “award board” that was not present 
in the classroom in which the dependent 
measure was taken. 

Our approach leads to a theoretical inte- 
gration between findings associated with 
achievement motivation and the overjustifi- 
cation research, an integration very similar 
to that which is apparent between the pun- 
ishment literature (in natural settings) and 
the dissonance literature. Research in the 
achievement area indicates that while ges- 
tures of parental love and affection (non- 
tangible reinforcers that do not draw atten- 
tion from the child’s behavior) are related 
to high achievement motivation in boys, the 
administration of tangible reinforcers by 
parents is not associated with high achieve- 
ment motivation (Rosen & D'Andrade, 
1959; Spence, 1970; Winterbottom, cited in 


McClelland, Atkinson, Clark & Lowell, 
1953). 
Tue COGNITIVE AND AVOIDANCE THEORY 
INTERFACE 


Cognitive development theories of mor- 
ality, as exemplified by the writing of Piaget 
and Kohlberg, are too extensive to be re- 
viewed here in detail. Rather, cognitive de- 
velopmental theory will be briefly outlined 
and the relevance of that approach to the 
proposed theory will be explicated at a 
general level. 

Piaget (1932) postulated that the devel- 
opment of the child's moral judgments cor- 
responds to the development of other more 
general cognitive capacities. Prior to the 
development of sufficient cognitive flexibility, 
the child makes moral judgments in strictest 
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conformity to established rules, viewing be- 
havior as totally right or wrong, basing 
that assessment on the outcome of an act 
regardless of the intent of the actor, and 
believing in the concept of “immanent jus- 
tice"—that punishment will automatically 
and surely follow transgression. With ma- 
turity and cognitive flexibility, moral judg- 
ments become more flexible, with rules seen 
as changeable depending on other constraints 
of the situation; the outcome of a specific 
act has less impact on the assignment of 
blame than the intent of the actor. In part, 
this transition to flexibility is made possible 
through active participation in society and 
through rule making and changing, allowing 
rules to be seen as serving man's needs. So- 
cial Participation also encourages the acqui- 
sition of increased empathy through inter- 
action with peers, 

Kohlberg (1958) extended Piaget’s ap- 
proach to a 6-stage sequence, each stage of 
which is defined by a different basis for 
moral judgment. In earliest stages, avoid- 
ance of punishment (Stage 1) and later the 

efinition of right and wrong based on the 
valence of the outcome for the actor (Stage 

Prevail. Middle stages involve judg- 
ments based on the maintenance of social 
approval (Stage 3) and then on duty to laws 
and the Prevailing social order (Stage 4). 
Higher Stages are based on an understanding 


a p^ Social-contract nature of social and 
€gal systems and respect for the rights of 
others (Stage j ne 


5) and finally a recognition 


nd moral principles i 
] are often in 
conflict, resolve em 


relationship of moral behavior to moral rea- 
Soning, or of the relation 


l ship of affect an 
emotional arousal to moral : n. 


2j however, 


nsgr ion. Thus fora 
guilt over violation of 
would result; for the 


Stage 4 individual, guilt would imply “con- 
cern about one’s responsibility according to 
rules"; for the Stage 3 individual, only 
shame would be experienced. 

Kohlberg's astute analysis is completely 
compatible with our orientation, but it is 
through the emotion-attribution perspective 
that we can address the issue of the dy- 
namics that account for this presumed usual 
compatibility of affect and moral judgment 
in the maturing individual. The key to our 
explanation is that temptation situations are 
basically unchanging as the individual ma- 
tures through the cognitive stages postulated 
by the cognitive-developmental theorists. 
That is, even though perceived somewhat 
differently with maturity, and even though 
different responses become available, the ba- 
Sic issues that lead to the emotional arousal 
associated with temptation remain essentially 
the same. It is, of course, impossible to 
prove this point, but it is difficult to name 
any moral issue faced by a mature adult in 
any sphere of personal or professional ac- 
tivity that does not have an obvious and ba- 
Sic counterpart for a 6-year-old child. There 
are mature and immature forms of stealing, 
of being dishonest or disloyal, of hurting 
others or being unfair, or of being destruc- 
tive to property and life. This view is sup- 
ported by the moral dilemmas used in the 
Kohlberg (1958) system to assess maturity 
of moral judgment—the situations can be 
seen as posing ethical dilemmas to individ- 
uals at any stage of moral maturity. 

How does this continuity of moral dilem- 
mas provide a mechanism for understanding 
the (normally) continuing compatability of 
moral judgment and its supportive affect as 
the individual’s level of moral maturity 
changes? A basic tenet of our approach is 
that many moral dilemmas evoke a phobia- 
like response, causing emotional discomfort 
as the individual begins to seriously consider 
transgression as an option. But that emo- 
tional arousal will have different meanings 
(and will affect behavior differently) for in- 
dividuals at different levels of moral develop- 
ment. Thus one’s Stage of moral develop- 
ment does not determine whether an indi- 
vidual will experience arousal, but rather 
Once such arousal js €xperienced, one’s stage 
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of moral maturity will influence what the 
arousal means and therefore whether the 
arousal is relevant to the available behavioral 
options. We do not wish to exclude the 
possibility that the level of maturity will in- 
fluence one's perception of the situation and 
that this perception will influence the amount 
of arousal experienced, but the impact of 
such initial perceptions on arousal level may 
be minimal. 

As an example, we might consider the 
problem of killing others in a war, assuming 
a society in which parental instruction, the 
major institutions of society, and the child's 
personal encounters with death all create in 
the child a negatively toned emotional arousal 
at the prospect of causing another's death. 
Other factors that might support this arousal 
include the recognition of the extraordinary 
exercise of power in killing, the irreversibil- 
ity of death, and the degree to which the be- 
havior is relevant to the support of or the 
negation of self-concept. When ordered to 
kill in war time, individuals at different 
stages in Kohlberg's (1958) moral develop- 
ment scheme would act differently. The 
Stage 4 individual, believing firmly in the 
supremacy of the law and the absolute need 
to uphold the law, would agree to kill, yet 
he would experience inhibiting negative emo- 
tional arousal (based on processes such as 
those discussed earlier), which cannot be 
eliminated readily through cognitive control. 
Unless overwhelmed with negative emotion, 
his attribution of affect to a fear of killing 
would be sufficient to make that negative af- 
fect less relevant, allowing him to kill (just 
as the shame-oriented child is able to trans- 
gress even when aroused if detection seems 
impossible, or as cheating is facilitated for 
the individual who can attribute his arousal 
to a placebo). With killing experience, the 
Stage 4 soldier’s emotional response should 
decline relatively rapidly (since, as indicated 
by Loftis and Ross, 1974, the attribution of 
arousal to an na source may facilitate 

uent extinction). 
dials at Kohlberg’s Stage 6 would 
initially experience arousal for the same rea- 
nd in a manner similar to the Stage 4 

But principles based on indi- 
cience allow more latitude of out- 
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individual. 


vidual cons 
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come. One may hierarchically arrange per- 
sonal values so that killing is required by 
conscience to defend principles more im- 
portant that the deaths involved, or one may 
decide that killing cannot be justified by the 
principles defended by the war. In the first 
instance, the arousal would be experienced 
as guilt (the individual assumes personal re- 
sponsibility for the decision) over the antici- 
pation of killing, but the attribution of that 
arousal to the violation of moral principles 
would not prevent the killing—the arousal 
would be of little relevance since other moral 
principles associated with the nominal pur- 
pose of the war have become more important. 

The Stage 6 individual who has concluded 
that life is more important than those princi- 
ples defended by the military action, on the 
other hand, experiencing his arousal as an- 
ticipatory guilt over the killing, would focus 
on his arousal as relevant to that behavior. 
"Therefore, the arousal could be experienced 
as a manifestation of the degree of opposi- 
tion the individual feels toward killing and 
as evidence of the correctness of the decision 
not to kill. As the three hypothetical indi- 
viduals approach their battlefield decisions, 
increasing arousal will therefore have differ- 
ent impacts on each. The Stage 4 indi- 
vidual, feeling his arousal as unwelcome fear, 
might begin to feel cowardly and ashamed 
of his emotional reaction. The first Stage 6 
individual, feeling anticipatory guilt, would 
feel more confused, wondering if this in- 
creased arousal should be taken as evidence 
of the wrongness of killing and hence of a 
need for the reordering of those values that 
favored killing. The second Stage 6 indi- 
vidual would, of course, become more con- 
vinced of the correctness of his decision not 
to kill. 

Emotional arousal and cognition are thus 
seen to interact in a complex manner, with 
behavioral outcome determined by the out- 
come of that interaction. When either is 
overwhelming, it may tend to dominate de- 
spite the counterinfluence of the other, but 
in the normal middle ground of life, that 
mechanism which permits us to display our 
humanity (when humanity is defined in the 
old-fashioned manner as the dominance of 
the rational over the appetitive and emo- 
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tional) seems to depend heavily on our abil- 
ity to experience our emotion as inoperative 
by attributing it to causes not relevant to the 
present situation. Tomkins (1963) wrote 
that it is "one of the major functions of any 
negative affect theory to guide action so that 
negative affect is not experienced,” and 
that this represented “affect at a distance” 
(pp. 320, 321). We would agree, but add 
that the major function of any theory that 
seeks to integrate cognition, affect, and be- 
havior must provide an explanation of how 
cognition, more differentiated and complex 
than affective responses, may moderate the 
impact of emotional arousal on behavior, 
even while failing to completely dominate or 
eliminate these affective responses. 
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A Testable Property of Some Discrete-State Models 
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Several current theories of recognition mem 
discrete states of memory with the assumpti 


familiarity or apparent oldness. 


memory models imply a lower bound on 


confidence ratings in yes-no recognition experiments. 
is derived that new items cannot be perfectly reco 


forgetting probability is zero. E 
this prediction to be wrong. 


Some years ago, Bernbach (1967) and 
Kintsch (1967) proposed models for recog- 
nition memory that combined the assumption 
of only a few discrete states of memory with 
that of continuous distributions of "apparent 
oldness" or "familiarity," on which a decision 
process operates. These models enjoy some 
popularity, particularly since some of the more 
comprehensive theories of memory, such as 
those of Bernbach (1970), Kintsch (1970b), 
and Anderson and Bower (1972), reduce to 
them under suitable conditions, On the other 
hand, it has been repeatedly claimed (e.g., 
Kintsch, 1970a; Murdock, 1974) that high- 
threshold models of recognition memory are 
rejected by the known experimental findings. 
These claims seemingly were reinforced by 
Krantz's (1969) article, which presents an 
important empirical test for threshold theories 
of signal detection. What seems to have been 
overlooked in recent research on recognition 
memory is that both the Bernbach and the 
Kintsch models can be regarded as generalized 
high-threshold models and consequently can be 
subjected to the same critical test. The pur- 
pu 
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ory combine the notion of a few 
on of continuous distributions of 


If a test for threshold theories of signal de- 
tection proposed by Krantz is applied, it can be shown that the dis 


crete-state 
the a posteriori curve obtained from 

From this the prediction 
gnized as new unless the 


xperimental data are presented that show 


pose of this article is to show how Krantz's 
test can be applied to examine the discrete- 
state assumption of the models and to review 
some critical experimental data. 
3 A yes-no recognition experiment can be 
imagined in which old and new items are pre- 
sented singly and the subject's task is to 
evaluate each item as old or new. In addition 
to the "yes, old" or “no, new" decision, the 
subject is required to give a rating indicating 
the degree of confidence in his response. We 
will concentrate on the so-called a posteriori 
curve, which plots the probability that a test 
ten was in fact old for each rating category. 
ye assume the rating scale to be ordered, 
ud br re confidence that the item 
iw. TF e hest confidence that the item was 
» eds subject's use of the rating scale 
seram sd. m monotonically increasing 9 
ende e denis is expected; the probability 
wale Ge ^ m old should start from sma 
Sel bs -confidence new responses a^ 
responses, "This is t tor Highenifience ol 
oe is is what is usually found 2”) 
theor 2 accounted for by any reasonable 
Wake tan recognition memory. If the discrete 
iem ig i holds, the a posteriori curve 
values n lower bound, below which n? 
That is, the all except for sampling errory, 
endi he probability that an item was ales 
given that the subject responded “new,” can- 
not generally attain values of 0, even for high- 
confidence responses. Stated differently, 2°” 
cording to these models subjects cannot recog” 


nize ite 
new items perfectly unless they have 
perfect memory. 
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In the following discussion, we derive the 
lower bound on the a posteriori curve predicted 
by the Bernbach (1967) and the Kintsch (1967) 
models and give an equivalent lower bound on 
the slope of the receiver operating charac- 
teristic (ROC) curve. First, we give a brief 
statement of the models. In order to avoid 
redundancy, we then derive the lower bound 
using a more general model. In the final section, 
the models’ predictions are compared to data. 


PROPOSED DISCRETE-STATES MODELS 


Bernbach’s (1967) model assumes that an 
item can be in one of two states: a memory 
state (R) and a forgotten or unlearned state 
(N). All new items are initially in state N. 
Old items at the time of testing are either in 
state R, with probability 1 — à ,or in state N, 
with probability à, where 6 denotes the for- 
getting probability. Attached to each state is 
a distribution of apparent oldness, with den- 
sities fr(x) and fy(x), respectively. As a 
consequence of the assumptions, the distri- 
bution for new items is given by fy (x), whereas 
that for old items is a mixture of the two dis- 
tributions fr(w) and fy(x). The model for 
recognition memory proposed by Anderson and 
Bower (1972), although derived from different 
assumptions, is formally equivalent to Bern- 
bach's model. 

Kintsch's (1967) model is more complicated. 
It has three memory states and three distri- 
butions of familiarity, but there is no one-to- 
one correspondence between them. Rather, 
the memory states are defined by intervals on 
the familiarity scale. Since we are mainly 
concerned with the decision aspects and not 
the learning process as described by this model, 
we shall simply ignore the memory states in 
this article. 

According to Kintsch's model, all new items 
have familiarities determined by a distribution 
y(x). After presentation of an item, it is 
assigned a new familiarity value. If its initial 
familiarity was less than a critical value £, its 
new familiarity is determined by a distribution 
o (x); if it was larger than £, the new distribu- 
tion will be $'(x), where $' (x) is assumed to 
equal $ (x) truncated at £. An item may be 
forgotten with probability fif its familiarity 
was less than a second critical value ^ > &, 
where forgetting means reassignment of a 
familiarity value according to the initial dis- 
tribution y (x)- Summarizing pe b be 
we have a similar picture to that in Bernbac : E] 
model: New items follow a distribution (x), 


whereas the distribution of old items is a 
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mixture of the three distributions W(x), (x) 
and $' (x). 

. Let us now introduce a more general nota- 
tion, which encompasses both models. Any 
given item can be in one of three states, Dj, 
j =1, 2, 3. All new items are in state D1; 
old items are either in state Di, Ds, or Ds. For 
brevity, we let d; stand for the probability that 
an old item is in state Dj, P(Dj|old). Since 
any old item is in exactly one state, Ed; = 1; 
for Bernbach's model d; = 0, as there exist 
only two states in this model. 

Let the confidence ratings be numbered from _ 
Ri to Rn, where low numbers indicate high- 
confidence old responses. In order to derive 
the a posteriori curve, it is necessary to specify 
how the confidence rating given to an item is 
determined by the item's state. Both Bern- 
bach's and Kintsch's models handle this by 
attaching continuous distributions to the states 
and by assuming that the subject adopts a set 
of cutoffs that divide the underlying scale into 
intervals; the subject responds with Rẹ if and 
only if the oldness or familiarity of the item 
presented falls into inverval I+. Since the 
property to be derived holds for any underlying 
distribution and any response rule, we do not 
follow this approach. Instead, we simply de- 
fine a set of conditional probabilities P(Ri|Dj); 
which gives the probability of responding with 
Rif the item is in state D; for all & and j. The 
only assumption necessary is that the prob- 
abilities of the various ratings depend only on 
the item's state. These conditional prob- 
abilities can be thought of as derived from the 
assumption of underlying distributions and 
multiple cutoffs, or they can simply be re- 
garded as free response bias parameters, which 
are under the subject's control. For brevity, 
we let rz; stand for P (R| D;); since exactly one 
response is given, we have 


ng = 1, for j= 1,2,3. 
k 


If we let O and N stand for the events that a 
test item is old or new, respectively, the a 
posteriori curve is then defined as P(O|R:), 
k = 1, n; that is, the probability that the item 
is old given the subject's response is Ry. From 
Bayes' theorem we have 


P(O|Rx) 
P (R«|0)P (0) 


= PQuIO)P(0) + PQu|N)PQU' 


where P(O) and P(N) denote the experi- 
mentally controlled probability that a test item 


(1) 
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is old or new, respectively. According to the choice of the r;;, or equivalently, any possible 


assumptions response assumption, in particular for those of 
P $ Bernbach (1967) and Kintsch (1967). 
P(R4JO) = X P(R4|Dj)P(D;|O) The parameter d; is essentially the forgetting 


jai probability in the models, since all new items 


are initially in state Dı and old items can be 
in state Dı only if they returned from D» or 
Ds, that is, are forgotten. For Bernbach's 
model, we immediately see that dı = P(Di1|O) 
— à, the forgetting parameter. Thus, this 
model implies that perfect recognition of new 
items as new, that is, P(O|R,) = 0 for high- ! 
confidence new ratings Ry, occurs only if there 
is no forgetting. 


radi + rred2 + risds, 


since any old item is in one of the three states, 
and the probability of using a given rating 
depends only upon the item's state. For new 
items we have 


P(R4N) = P(R;l| Di) = ra, 


since all new items are in state Di. If we 
insert this into Equation 1, the result is 


For Kintsch's model, d; is derived as follows: 
P(0|R;) 


£ denotes the probability that the familiarity 
of a new item exceeds £, and a X c and c 
denote the probabilities that an item in state 
D» or Ds, respectively, exceeds A. Since 
forgetting, that is, return to state Dı, occurs 
with probability f if the item's familiarity 


(ready + reads + reads) P(O) 
(radi + rrodo + reads) P (0) + ri P (NY 


If we replace P(N) with 1 — P(O) and divide! 
every term by ria, we find 


was less than A, it follows that dj = (1— g) ; 
P(0|R;) X(ü-—aXof-Fs(-of This expres- ; 
ko, y k3 sion equals zero if f = 0 or if g = 1, c = 1, or 
hP (0)-+ (a4 a) P(0 “oneg ue eo kas hg 
s ROE fkl Ean 3)P(0) c = 1, a = 1, which means that either there is 
E e " : no forgetting, it trans 
asP(0)+1—P(0)+ (“ast as) P(0) getting, or new items are transferred to 
" " 


a perfect memory state on the first presenta- 
ton. The model thus makes the same pre- 
diction as Bernbach's: Perfect recognition of 
new items as new occurs onl 
is possible. 


(2) 
It is straightforward to prove that the right 
side of Equation 2 is at least as large as 


dıP (O)/[d1P (0) + 1 — P(O)], since the term 


ree 


y if no forgetting 


There are two ways of testing the lower 
bound predicted by t 


Tk3 
h+” d) P(0) 


ie he discrete-state models. 

E ne is to E whether empirical a posteriori 
: f curves reach the zero ley iti 

is nonnegative. Thus, the final result is where th e ander ace 


L0 dP(O 
diP(0) + 1 — P(0) 
if di P(0) #0. (3) 


e assumption of no forgetting can be 
aum | on other grounds. This will be demon- 
ated in the next section. It should be noted 
that failure to reject the discrete-state models 


does not lead i 
l ad to their acceptance, since con- 
Equation 3 shows that the lower bound on tinuous models such as strength theory (e.g. 


the a posteriori curve is determined by the Norman & Wickel $ 
probability that an old item is in state Dy, dy, bounded a Puno n. kapky ior 
à E EA. dice he S ` z ENTS aS wel Aiso, V 
ad the vire Ws het à SIN, test Them, is WS be thar The obtained curves do mot 
AM DQ. Ses. VAD) 16 SORS Ny. "i 
WW) Sw V 


SR. S "rivos Sangi 
yensonabie tec NO, Wenay VR, it 
follows that, according to the discrete-state 
models, the a posteriori curve can attain zero 
only if dı = 0, that is, old items are never in 
state Di. It should be noted that no assump- 
tion about the rating probabilities rj; was 


w) 
P(0)1R)) > pay 


>0 


ESSE ES 

NS WS, tenet in, Whe eres, Sos iN 
Que to too few rating categories or insufnàent 
Motivation of the subjects to use the highest 
rating categories properly. 


The second possibility of testing the dis- 


needed to derive Equation 3. Therefore, the 
bound given in Equation 3 holds for any 


1 We are assuming ta # 0. If rm = 0, P(O|R)) 
= 1, which is consistent with the bound given in 
Equation 3. 


crete-states assumption is to use independent 
estimates of the forgetting probability, di, 
derive a quantitative prediction for the asymp- 


tote of the a Posteriori curve, and compare 
the prediction to data. For example, we 
know from short-term memory experiments 


employing the Peterson and Peterson (1959) 
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paradigm that a single consonant trigram 
(CCC) is recalled after 18 sec with a prob- 
ability of about .2 (e.g., Murdock, 1961). If 
the recall rate could be used to estimate the 
recognition forgetting-probability, we predict 
a lower bound of .8 X .5/(.8 X .5 + .5) = .444 
for a corresponding recognition experiment, 
with P(O) — .5. Thus, under these conditions 
we should find that on at least 44% of those 
trials where the subject responded new with 
highest confidence, the test item was in fact 
old, which would be hardly better than chance. 

The problem with this second approach is 
that acceptable independent estimates for the 
recognition forgetting-probability are difficult 
to find. According to the original version of 
Bernbach's (1967) model, all items in state R 
will be correctly recalled, so that equating 
recall and recognition forgetting-rates would 
be defensible except when guessing occurs 
during recall. However, in a more recent 
version of the model, Bernbach and Kupchak 
(1972) proposed that the recall probability 
equals the probability of an old item being in 
state R times a multiplier less than one, which 
can be interpreted as a retrieval probability. 
Similarly, the dual-process theory of recall and 
recognition (Kintsch 1970b; Müller, 1913) 
maintains that recall involves item retrieval 
and an implicit recognition decision, whereas 
recognition involves only the latter. Thus, 
according to these models, the recall forgetting- 
rate underestimates the true recognition for- 
getting-probability by an unknown amount. 
Because we see no easy way of getting quanti- 
tative estimates of the forgetting probability 
for the discrete models from independent recog- 
nition data, we will not use this second 
approach. 

In order to apply the test to experimental 
data, we need information about the a pos- 
teriori curve characterizing the subject’s recog- 
nition performance. However, many authors 
publish their data only in the form of ROC 
curves computed from the rating frequencies 
(see Green & Swets, 1966, for the construction 
of rating operating characteristics). Since the 
ROC and the a posteriori curve are but two 
ways of looking at the same data, it should 
come as no surprise that a property of the ROC 
can be derived that is analogous to the lower 
bound on the a posteriori curve predicted by 
the discrete-state models. We shall now show 
that the minimum slope of the ROC cannot 
be less than the forgetting probability, di, if 
the discrete-state assumption holds. 


The hit rate, H+, and the false-alarm rate, 
F;, corresponding to the &th point on the 
ROC, are found by cumulating the first k 
responses as "yes." Using the assumptions 
stated above, we find 


k 
Hy = 2; P(R;|0) 


j=l 


k k k 
— d, ra d- ds 35 ret dD ris 
1 1 i 


k k 

F; = 25 P(R;|N) = 2 Tn. 
jel 1 

For the slope of the straight line connecting 

the & — 1 and the &th point of the ROC curve, 

sp, it thus follows that 


Hy — Be 
Sk = * E = (diri + dario + daria) /ria 


= dy + deris/riy + daris/ria. 


Since the last two terms on the right are non- 
negative, the following inequality holds for 
k=1ton: 


Sk zt di. (4) 


Thus, an alternative way of testing the 
discrete-state assumption is to examine the 
minimum slope of the ROC curve. If it goes 
to zero, this implies that dı = 0, that is, there 
must have been perfect retention according to 
the discrete state models. A simple graphical 
check is to see whether the ROC curve touches 
the ceiling of the unit square, that is, whether 
any point lies on the horizontal line connecting 
the points (0, 1) and (1, 1). If this occurs, the 
line connecting this point to the zth point of 
the ROC, which equals (1, 1), must have slope 
$5 — 0. Of course, this finding has the same 
implication as the a posteriori curve extending 
to zero. 


Some EMPIRICAL RESULTS 


We have searched the recognition-memory 
literature for articles on experiments with 
yes-no designs with confidence ratings that 
report a posteriori curves or ROC curves for 
individual subjects. From the seven articles 
we could locate, one (Wickelgren & Norman, 
1966) could not be used, because the authors 
eliminated all points of the ROC curves with 
hit rates above .98, and these, of course, are 
the crucial data for a test of the discrete-states 
assumption. All but one (Norman & Wickel- 
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FIGURE 1. A posteriori curves 


(Section a shows the curves for the 
curve for the combined data.) 


gren, 1969) of the remaining six reports indicate 
some violations of the assumption. 

Kintsch (1968) used a modified Peterson and 
Peterson (1959) procedure. „On every trial, a 
list of five 4-consonant combinations (CCCCs) 
was presented at a rate of 1-1.5 sec per item, 
followed by a distraction task of 20-sec dura- 
tion. On yes-no tests, either an item identical 
to one of the five list items or a new 
presented ; old and new items occurred equally 
often, that is, P(O) — .5. The subjects were 
to give old or new responses and a confide: 


one was 


nce 
rating on a 4-point scale. On 60% of the 
trials, forced-choice tests with two, four, or 


eight alternatives were given. 
were employed. 

The analyzed data are based o 
which items from Positions 1-4 w 
Figure 1 presents the a posteriori 
individual subjects and the data combined 
over subjects. Each curve reaches the zero 
level for the highest-confidence new rating, 
Although a statistical evaluation of the re. 
liability cannot be given, because the number 
of observations on which the estimates arc 
based is not known, it seems evident from the 
shape of the curves that the true asymptote 


Four subjects 


n trials in 
ere tested, 
Curves for 


obtained in Kintsch's (1968) experiment, 
individual subjects (Ss), and Section b the 


of the a posteriori probabilit 


fore, in order to account for these data, the 
Bernbach and the Kintsch model must assume 
a forgetting rate of zero, 

Intuitively, 


ies is zero. There- 


the assumption of zero for- 
&etting-probability does not seem very plau- 
Sible under these conditions. It would imply 
that five CCCCs can be held in memory for 
Sec without any loss of information and that 

is sufficient to learn them per- 

of the serial position effects 
ormance supports this in. 


An analysis 
9n recognition perf 
tuitive argument, 


no and 
the proportions 
ms for Positions 
7, respectively. 
ighly significant 
8, p. 2), mostly 
ance on items in 
cognition memory 
cannot have been 
“State models are 


tions 1-4 
perfect. Thus, the discrete 
Tejected by these data, 

Schmidt and Vor Ç 
Ployed the p mea 


" ote 1) also em. 
A eterson and 
Paradigm with 
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rating category 


Fıcure 2. A posteriori curves obtained in Schmidt and Vorberg's (1969) 


experiment. 


(Section a shows the curves separately for the three retention 


intervals, 13.8, 18.8, and 23.8 sec, and Section b the curve for the combined 


data.) 


a trial, a single 3-digit number was presented, 
followed by a distraction task of 13.8-, 18.8-, 
or 23.8-sec duration. The test was either 
yes-no recognition or recall. Within sessions, 
recognition and recall trials alternated ran- 
domly; on recognition trials, the probability 
of presenting an old item, P(O), was .33. The 
subject gave his response on a 10-point scale; 
the 5 left-hand response categories indicated 
new responses. After sufficient training to 
reach a stable performance level, the subject 
was run for 36 daily sessions, with 86 trials 
each. 

Figure 2 shows the a posteriori curves for 
each retention interval separately and the 
combined data; each curve is based on ap- 
proximately 340 observations. Again, each 
curve asymptotes to zero. On all 585 trials 
on which the subject gave one of the highest 
three new ratings a new item had, in fact, been 
presented. A statistical computation shows 
that the hypothesis of a true a posteriori 
probability of .0051 or higher for these rating 
categories can be rejected on a significance 
level of at least 5%. Since a bound of .0051 
corresponds to a value of di of .010, we can 
safely conclude that according to the discrete- 
state models the forgetting probability must 
have been .010 or less. 

Unfortunately, in this instance we do not 
have independent evidence against the zero 
forgetting-assumption except from recall data. 
The proportion of correct recalls after 13.8, 


18.8, and 23.8 sec was .787, .676, and .556, 
respectively. Thus, there was an appreciable 
amount of forgetting; however, as mentioned 
above, this may merely reflect retrieval diffi- 
culties and does not necessarily imply nonzero 
recognition forgetting-rates. 

Kintsch and Carlson (1967) investigated 
recognition learning in a paired-associates par- 
adigm. Subjects were presented lists con- 
taining either 14, 18, or 22 pairs, with either 
CCCs or CVCs as stimulus members. Only 
two different response alternatives occurred 
within a given list, either 1-digit numbers, 
such as 1 versus 2, or letters, such as A versus 
E. Pairs were presented for 1.5-2 sec, à 
random half of them paired with the correct, 
the remainder with the incorrect response. 
Subjects were to respond yes or 70, depending 
on whether they thought the stimulus was 
paired with its correct alternative or not, and 
to give a confidence rating on a 5.point scale. 
After a response, feedback was provided. 
Learning continued until subjects reached a 
criterion of two successive trials with correct 
response to all pairs. Four subjects were 
employed. 

Figure 2 of the Kintsch and Carlson article 
(1967, p. 894) displays the rating ROC curves 
for recognition performance on the first trial. 
For three of the four subjects, the point 
corresponding to the laxest criterion lies on the 
horizontal line, (0, 1)-(1, 1), indicating a zero 
slope of the ROC curve and thus a forgetting 
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probability of zero. For the fourth subject, it 
comes very close. Due to the large amount of 
data on which the ROC curves are based, this 
result is very reliable. Combining the data for 
the three subjects with zero slope, we find that 
on none of the estimated 146 trials on which 
they used the highest-confidence new rating 
was the correct pair presented. This implies 
that we can reject the hypothesis, d, 2 .020, 
on a significance level of at least 567. 
What evidence exists against the assumption 
of zero forgetting, that is, perfect recognition 
memory on the first trial? Kintsch and Carlson 
do not report explicitly ho: 
performance improved af 
however, a simple calcul 
formance on the first tria 
asymptotic. From Figure 4 of the Kintsch 
and Carlson (1967, 
value of about .18 f 
Portion of incorrect 
the first trial (disregarding confidence ratings). 
If this value were characteristic of the asymp- 
totic recognition performance, it would take a 
subject an expected number of 273.0 trials to 


With an error rate of .18, this leads to 
an expected number of 49,2 errors per item, 


which is to be compared to an observed mean 
number of .73 errors per i by 


Kintsch and Carlson. Thus, it is clear that 
the assumption of asymptotic recognition per- 
formance on the first trial cannot be considered 
seriously, which implies that the forgetting 
probability on the first trial must-h 
appreciably larger than zero. Since the ob- 
tained minimum slopes of the 
contradict the i 


Murdock (1974, p. 27) presents data from a 
single subject from a study on word recognition 
run by Murdock and Dufty (1972). On a 
trial, a list of 15 common two-syllable English 
Words was presented at a rate of .8 Sec/ word, 
followed by a 30-word test list containi 
original 15 words and 15 distractors, 
test word the subject responded on a 
rating scale; the 3-left-hand rating ca 
indicated new responses, 

For the subject, the a 
for the 


6-point 
tegories 


posteriori 
highest-confidence new 
estimated as .036, which correspon 
of di — .037, since P(0) = .5. 
large number of observations, 
reject a value of dı > .048 on 


Probability 
response js 
ds toa value 
Due to the 
these data 


a Significance 
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level of a < .05, Although the a posteriori 
curve does not go down to exactly zero, this 
may well have been due to the few rating 
categories employed, as the steepness of the 
curve suggests (see Figure 2.11, Murdock, 
1974, p. 33). Again, in this instance we do 
not have independent evidence on the true 
forgetting rate; however, we know from com- 
parable studies that there are huge serial- 
position effects that are not apparent here, 
because Murdock’s data are averaged over list 
positions, For example, Shiffrin (1970) reports 
results from an experiment in which a list of 
30 common words presented at a rate of 1 
sec/word was followed 
each pair subjects had to identify the old word. 
The data show 


curve, with primacy and recency parts; the 
Proportions 


In a recent study com 
retention tested by and m-alter- 
native forced-choice recognition, Murdock and 


six syllable word 


test pair w 
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: ponse term, ]f the I 
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.033, respectively, corresponding to forgetting 
probabilities of .011 and .034, respectively. 
The data are stable enough to reject forgetting 
‘rates larger than .029 and .061 for the slow 
and the fast condition, respectively, on a 
significance level of a < .05. 

Is it reasonable to assume that the recog- 
nition forgetting-probabilities for items from 
Positions 5-6 were practically zero? Murdock 
and Wells (1974) present some data that make 
that assumption highly implausible. First, 
after suitable corrections for guessing we find 
a mean correct recognition rate of .895 for 
these items, as estimated from the forced- 
choice data, averaged over subjects, pre- 
sentation rates, and number of alternatives. 
This clearly leaves room for improvement. 
Second, from the recall data we see that items 
from Position 6 were remembered better than 
items from Position 5 (for slow presentation, 
the proportions correct for Positions 5 and 6 
are .836 and .929, respectively; for fast pre- 
sentation the corresponding figures are .830 
and .924). Both differences are significant by 
t tests of the differences of the arc-sine trans- 
formed proportions, ¢(7) = 2.82, £(7) = 2.26. 
In view of the nearly identical average correct 
recall and recognition rates (.881 and .895, 
respectively) we tend to believe that corre- 
sponding serial-position effects must have been 
present in the yes-no recognition performance. 
Further evidence on the completely analogous 
effects of list positions on recall and recognition 
probabilities comes from a study by Bernbach 
and Kupchak (1972). In a comparable ex- 
periment employing lists with five paired- 
associates, they found the usual serial-position 
effects both for recall and yes-no recognition. 
The effects were present both in empirical 
indexes of retention (d' and proportion cor- 
rectly recalled) and in parameter estimates of 
the recognition and recall probabilities of their 
model. Taking these arguments together, it 
seems safe to conclude that the assumption of 
zero forgetting for the average recognition 
performance on items in Positions 5-6 is 
untenable. Again the discrete-states assump- 
tion has to be rejected. 


DiscussION 


Our review of some pertinent experimental 
results has shown that the prediction derived 
from the discrete-state models of recognition 
memory is not supported by the data. The 
models proposed by Bernbach (1967) and 
Kintsch (1967) can account for perfect recog- 


nizability of new items that obtains in most 
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studies only by assuming zero forgetting- 
probabilities. This clearly contradicts the 
evidence given for the forgetting losses in these 
experiments. Admittedly, the reasoning by 
which nonzero forgetting-probabilities were 
inferred was indirect in some instances; how- 
ever, the fact that all the data presented? are 
in agreement supports our conclusion that the 
discrete-state models are inadequate. 

The experiments reported employed a variety 
of recognition tasks: single items presented in 
a Peterson and Peterson (1959) paradigm, 
items presented in lists, and paired-associates 
recognition learning. Also, diverse materials 
were used: 3-digit numbers, 4-consonant com- 
binations, trigram-letter and trigram-number 
pairs, single words, and word pairs. Thus, the 
rejection of the discrete-state models of Bern- 
bach (1967) and Kintsch (1967) is not confined 
to a limited class of recognition situations; 
rather, it seems that these models suffer from 
a basic shortcoming. This conclusion also 
holds for the recognition model proposed by 
Anderson and Bower (1972), to the extent that 
this model can be applied to the studies em- 
ploying word recognition. 

It would be wrong to conclude that the data 
presented lead to a rejection of the whole class 
of discrete-state models of recognition memory. 
The problem with the specific models of 
Bernbach and Kintsch is that they do not 
assume a separate state for new items; instead, 
the new-item state, Di, is also entered by old 
items unless memory is perfect. However, in 
order to account for perfect recognition of new 
items, these models have to keep new and old 
items in separate states, which can only be 
achieved by setting the forgetting probability 
tozero. Thus, it should be clear that discrete- 
state models containing a state that can only 
be occupied by new items but never by old ones 
have no difficulty in predicting perfect recog- 
nition of new items. 

Discrete-state models with this property can 
be derived from Bowers (1967) multicom- 
ponent theory, which assumes that items are 
encoded by vectors of components. In one 
version of the theory, forgetting is seen as loss 
of components. At the recognition test, the 
subject compares the retrieved component 
vector of the item tested with the encoded 
vector of the test item. Given that there is 
no problem of retrieving the correct vector 
from memory, mismatches between the com- 
ponents of both vectors can only obtain if the 


* Except those of Norman and Wickelgren (1969) 
mentioned earlier. 
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test item was new. Therefore, the occurrence 
of mismatches could be used by the subject to 
recognize new items correctly. Mismatches do 
not have to occur frequently at all for a subject 
- to reach a posteriori values of zero. If the 
subject is able to confine his high-confidence 
new ratings to those few trials where mis- 
matches obtain, he will achieve zero a posteriori 
values in spite of forgetting of components, and 
high similarity between the new and the old 
item distribution. Thus, it is apparent that 
perfect recognizability of new items in spite of 
nonzero forgetting is not at all crucial for 
models with a finite number of discrete states, 
Instead of giving up the discrete-state notion 
in general it might be worthwhile to consider 
recognition models based on Bower's (1967) 
multicomponent theory more seriously. How- 
ever, an entirely satisfactory application of the 
theory to the different kinds of recognition 
tasks requires a more thorough investigation 


of the retrieval Processes by which the corre- 
Sponding memory traces are found. 
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A Comparison and Elaboration of Two Models of Metacontrast 


Naomi Weisstein 
State University of New York at Buffalo 


Gregory Ozog and Ronald Szoc 
Loyola University, Chicago 


Metacontrast has been the subject of two neural network simulations by 
Weisstein and Bridgeman. We compare and elaborate on the two models, 
correct flaws not inherent in the models’ conceptualizations, and discuss the 
remaining shortcomings. The idea behind how U-shaped metacontrast 
functions are generated is similar in both models, but the assumptions about 
how the visual system is organized are quite different, In one of the 
models temporal ringing, combined with a complex and implausible linking 
hypothesis, is necessary in order for masking to be obtained; this model 
assumes a single spatial and temporal channel. In the other model, mask- 
ing does not depend on temporal ringing; this model assumes multiple 
spatial and temporal channels and a simple linking hypothesis. We show 
that of the two models only the second adequately predicts empirical meta- 
contrast functions, and we relate this model to recent evidence that the 
visual system contains multiple channels. Sometimes treated as a puzzling 
and somewhat singular phenomenon, isolated from the “mainstream” of 
visual data and theory, metacontrast may, on the contrary, turn out to be 
one of the more interesting manifestations of a multiple-channel visual 
system. 


Metacontrast backward masking has been 


tI caedis briefly flashed target (general a disc or 
he subject of two neural network simula- 


rectangle) and a briefly flashed mask (gen- 


ape in this journal (Bridgeman, 1971; 
^ €isstein, 1968). Metacontrast would seem 
oe enough as a backward-masking 
ot and might not generate such interest 
ere it not for the data it produces. If a 
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erally an annulus or two rectangles that 
flank the target) are above threshold and of 
roughly equal energy, the greatest reduc- 
tion in apparent brightness of the target will 
occur when the mask is presented as much 
as 125 msec after the target has been turned 
off (Alpern, 1953). But when the target 
and mask are presented simultaneously, little 
if any reduction in apparent brightness oc- 
curs. This reduction in apparent brightness 
with increasing delays between target and 
mask is illustrated in Figure 1. It is the U 
in this function that generates such interest 
in metacontrast. Why should a target dis- 
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appear only if a mask is delayed by some 
60-125 msec and not before? + 


Late Activity AS AN EXPLANATION 
FOR METACONTRAST 


Most theories attempting to explain the 
U-shaped functions in metacontrast have a 
common assumption, namely, that the kinds 
of masks used in metacontrast suppress or 
alter the late portion of the response to the 
target, and do not interfere with earlier por- 
tions. If so, in order for masking to occur, 
the mask must be delayed until its presenta- 
tion will interfere with the appropriate 
(late) portion of the target response. 

The response to the target may be divided 
into early and late portions if for no other 
reason than that a brief stimulus flash pro- 
duces neural activity which may last well 
over 150-200 msec after the Stimulus goes 
off (Sturr & Battersby, 1966; measurement 
of the gross potential)? In addition, the 
IW EE cus 


! One way around this 


puzzle is to assume that 
U-shaped functions in 


metacontrast do not really 
exist (eg, Eriksen, Becker, & Hoffman, 1970; 
Lefton, 1973a). However, Proponents of this 
Position will admit that there is a U-shaped re- 
duction in apparent brightness of the target 
(Eriksen et al., 1970; Lefton, 1973b) ; their main 
Criticism is that detection functions in metacon- 
trast are not U-shaped. This is generally true 
(but see Cox & Dember, 1972) ; however, it is an 
irrelevant argument, There are a number of 


similar paradigms which demonstrate that detec- 
tion and apparent bright 


ness do not covary. Corn. 
sweet and Teller (1965) found that increment 
thresholds remained constant even 


r though the 
apparent brightness of a steadily presented disc 
varied depending o; i 

ing annulus, Few 
increment threshold 
simultaneous brightne 
discs do look 
annuli than 


not change, 
not "reap". 


priori that 
Ontrast are not 
Serious attention (eg, Sekuler, 1973) 
» It would seem that we progress þy : 
rather than evading, that which is | 


Worthy of seri 


ploring, 


ex- 
zling. 


)uz- 
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early portion of this activity looks iau 
from the later portions, If early anil Ke 
correspond to different perceptual even 5 
then the late-activity explanation for Biete 
contrast may make sense. In dip ees 
something about the appearance of a ke n 
is affected, while its detectability is generally 
unchanged: Its apparent brightness or con- 
trast decreases, and its contours break up if 
fragment. The loss in detectability is held 
to a minimum by the metacontrast eae e 
Both target and mask are well above thres i 
old, and they are adjacent to each othet 
rather than superimposed on the same reti- 
nal location, If we suppose appearance to 
be connected with the late portion of neural 
activity, then a mask, to be effective in meta- 
contrast, might have to coincide with the 
late portions of neural activity to the target. 
Hence U-shaped masking functions 
be predicted. In somewhat loose 
then, two adjacent, briefly flashed, 
threshold stimuli mig 
each other's appear. 
produced by one 


would 
terms, 
above- 
ht not be able to affect 
ance unless the activity 
of the stimuli "hits" the 
right portion of the activity produced by the 
other stimulus; if this portion of activity is 


late activity, then U-shaped functions will be 
generated, 


NEURAL MODELS ANp VISUAL-SYSTEM 


ORGANIZATION 

To tie these ide 
dictions, one m 
response or yj 
model should 


as down to specific pre- 
ay make a model of neural 
sual-system activity. This 
Senerate metacontrast func- 
tions, be consistent with what is known 
about Mammalian visual-system organiza- 
tion, and make reasonable linking hypotheses 
—that is, make reasonable hypotheses about 
how neural Tesponse might correspond to 
the perceptual data (Brindley, 1970). 1- 

Before considering each metacontrast Le 
ulation in particular, a few comments poder 
order about two somewhat competing he 
of general visual- 
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Ficure 1. Metacontrast functions for each of three subjects, TJ, BW, and JP, as indicated 
on the figure (solid curves) at each of five target-to-mask energy ratios, T/M (functions 1 
through 5; from Weisstein, 1972). The mask was always a lighted 16 msec, 16 footlambert 
(54.8 cd/m?) annulus; the target was a lighted, 16 msec disc whose luminance decreased 
respectively from 1 through 5 as follows: 16, 8, 3.2, 2, and 1 footlambert (54.8, 27.4, 10.9, 
| : 6.8, 34 cd/m*). Each point on the solid curves represents a geometric mean of five mag- 
Ae nitude estimations of the apparent brightness of the disc at each of 27 Ats, beginning with 
Ar — 100 msec and proceeding in 10 msec increments to At = 100 msec; thereafter in 25 msec 
increments to At= 200 msec, and finally, 250 and 300 msec. U shapes are found for each sub- 
ject for each T/M except the lowest one or two. Dotted vertical lines indicate simultaneous 
presentation of target and mask, or M — 0. If the function falls below the amplitude at the 
dotted vertical line, then it can be said to be U-shaped. The height of the solid vertical lines 
represents the points predicated by Weisstein’s (1972) model. (From "Metacontrast" by Naomi 
Weisstein, in D. Jameson & L. Hurvich (Eds.), Handbook of Sensory Physiology (Vol. 7, 
part 4: Visual Psychophysics). Copyright 1972 by Springer-Verlag. Reprinted by permission. ) 


vey. respond to the assumption that the visual 
Single Channa system responded as if all single units had 
à . the same receptive field size and time course 

"e single-channel view assumes that visual- — of excitation and inhibition. Then one would 
Stem response can be characterized by one simply need one function to describe the 
m parameters. To put it in terms of visual system's spatial properties and one 
4y ons, a single spatial and temporal chan- function to describe the visual system's tem- 


| Single-channel view and the multiple-channel nel (or, equivalently, filter) would cor- 
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poral properties. The single-channel as- 
sumption is not necessarily contradicted by 
the evidence that single units in the visual 
system have different receptive field sizes 
and different rates and latencies of firing, 
since all units could contribute to a com- 
posite size and response rate, (See Nach- 
mias, 1968, for a discussion ofa composite 
visual-system spatial response, and see 
Cornsweet, 1970, for an excellent discussion 
of how if one assumes a single channel, one 
can obtain an overall estimate of the visual 
system’s temporal and spatial response by 
utilizing frequency-response techniques. ) 


Multiple Channel 


A multiple-channel view assumes that to 
characterize visual-system response, families 
of parameters are necessary, Many differ- 
ent channels may be active in the visual sys- 
tem, depending on whether a stimulus is 
small or large, periodic or aperiodic, transi- 
ent or sustained. These channels do not 
contribute to one composite, overall function ; 
rather, at some stages in the visual system 
their activities are independent of each 
other. Thus, for instance, at threshold it 
would not be the sum of the activity of a 


number of different channels that determines 
threshold, but the a 


channel that has reached tl 
not directly influenced by the subthreshold 
mumblings of other channels that may be 
active. (The threshold itself may be low. 
ered somewhat by subthreshold 
due to Probability summation 


c : ar channel as 
of the activity in another, 
m press, for an exe 

An above-thresh 
channel mod i 
channels 
tures of 
edges of 
others for 


to say about this later, 


fa multiple- 
different 


sume that units with di 
sizes respond to stimuli 


within the range of their field size, and that 
when a stimulus is too small or too be s 
the unit will simply not respond,  (Spatia 


d a; A 
multiple-channel models have usually been 


phrased in terms of parallel spatial filters, 
each responding to a hand of spatial fre- 
quencies, but we prefer, for the moment, to 
use à somewhat looser notion.) A temporal 
multiple-channel model would assume that 
units would respond at different rates, and 
this implies that some units might prefer 
transient stimulation and other units might 
prefer sustained stimulation. Again, we will 
have more to say about this later. The 
multiple-channel model doesn't mean that 
only one channel will respond if only one 
stimulus is presented: a particular stimulus 
may produce response in a number of chan- 
nels. The important point is that these 
channels are looking at different aspects of 


the stimulus, and they are responding at 
different rates, 


Tur Two METACONTRAST SIMULATIONS 


Of the two attempts to qu 
thesis that metacontr: 
of the late activity or 
System response to 
stein, 1968, 1972) 
model, and the other 
a single-channe 
the two models 
more plausible, 
as it stands, and į 
better fits the dat 
elaborate on botl 


antify the hypo- 
ast is the suppression 
slow portion of visual- 
a stimulus, one (Weis- 
is a multiple-channel 
(Bridgeman, 1971) is 
1 model, We will examine 
to see which one seems the 
Neither model is adequate 
n considering which model 
a, we need to expand and 
1? models, First, we will 
an's simulation, which as- 
channel, taking the steps 
it to generate metacontrast 
Was not done in his paper). 
^$ We examine this model, some serious 
shortcomings that have previously escaped 
notice Will become apparent. Then we will 
consider Weisstein’s simulation (1968, 
1972), which assumes multiple channels, 


expanding it so that it can deal with spatia 


Interactions. We will show that two man 
comings noted for this model as it was a 
ulated may be more apparent than a. 
Finally. we will DIESihE am elaboration " 
this model, which associates the fast * 
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FiGvnE 2. The activity of Bridgeman's (1971) neural network (the neural 
images) for each of six successive time periods. The top row of Figure 2 
shows neural images that occur during and aiter a disc is presented. The 
bottom row of Figure 2 shows neural images that occur during and after 
a disc is presented, followed by an annulus. The main thing to notice is 
that the neural images of cycles 3 through 6 for disc alone (top row) versus 
disc plus annulus (bottom row) do not resemble each other: This is the 
basis of the metacontrast interaction. Time is a discrete function of num- 
her of cycles: This is plotted in the inset. 


slow responses assumed by Weisstein with To obtain a picture of neural activity, 


the recent work on transient and sustained 
mechan: ; ; 
nechanisms (channels) in the visual system, 


Aletacontrast Simulation | 


Bridgeman (1971) hypothesized that meta- 
ee Occurs because the late neural 
s tivity to a target followed by a mask dif- 
rs from what it would be to à target alone. 


Bridgeman uses the one-dimensional neural 
network developed by Ratliff and his co- 
workers (1965), in which each neuron in- 
hibits and is inhibited by its neighbors at re- 
current intervals. A neuron in the system 
inhibits its neighbors in proportion to its own 
activity and to its distance from its neighbors. 
In addition, inhibition is subject to time de- 
lays: Adjacent neurons are inhibited with a 
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time lag of 30 msec and neighbors one re- 
moved are inhibited with a time lag of 60 
msec. To simulate this digitally, Bridgeman 
uses the Ratliff equations (1965, p. 110), 
with parameters chosen from Barlow's 
(1969) study of Limulus. This network is 
single channel because it has a single set of 
spatial and temporal parameters—a spatial 
weighting function consisting of a point 
excitation and a spread of inhibition on 
either side, and a temporal weighting func- 
tion that is simply a step for excitation 
and inhibition from adjacent units, and a step 


with a one-period lag for inhibition. from 
units one removed. 


The Network 


The Ratliff (1965) equations represent a 
discrete form of repeated inhibition: Each 
recurrence of inhibition “spreads” a signal 
laterally so that one by one, units further 
and further removed from the site of stimu- 
lation become involved. The activity of the 
neurons in the network, what we may call 
the neural image, is shown in Figure 2, 
during and after stimulation, for each suc- 
cessive recurrence of inhibition of the net- 
Work. The top row shows a network re- 
sponding to a target alone, and the bottom 
TOW shows a network responding to a tar- 
get and then a mask, which is presented 
aiter the initial Tesponse and one Cycle of 
inhibition. of the network, The inhibition 
delay defines time in this network ; 
one cycle means 


60 msec 
This quantiza- 
s shown in the 


* This figure wa 
our own simulati 


cause we 
each had 
amplitude; it is easi 

My easier 
"d Images look Jik in Cycl 

A " 7 e: 1 

ues signal is very small, (4 djusti iE x 
Deak-to-trough amplitude, or normalizing S ie 
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As shown in Figure 2, when the stimulus 
is on, the corresponding neural images re- 
main fairly similar to the physical stimulus 
through successive cycles, that is, through 
successive 30 msec intervals at which the 
inhibition recurs, Each successive recur- 
rence can be seen to spread the signal, and 
edges and troughs develop at the borders of 
the stimuli. (Edges and troughs generated 
by networks like these are the basis of many 
explanations of Mach bands and related 
effects.) Bridgeman adopts the term homo- 
photic from Stigler (1910) to describe the 
set of neural images during stimulation. The 
term metaphotic describes the set of neural 
images that occur When the stimulus is re- 
moved. Although the metaphotic images 
no longer resemble the physical stimulus, 
they are uniquely related to it. 


Masking 


The set of neural images that occurs when 
the target is presented by itself comprises 
the reference to which all other conditions 
are compared. The amount of masking is 
assumed to depend on the extent to which 
the set of neural images formed when both 


target and mask are presented differ 
the set of neural im 


target is presented ] 


s from 
ages formed when the 
Y itself. This measure 
functions, 
larity that 
cated, since 
functions, ) 


does not ch 
Bridgeman 


i n ame size, it should be stressed 

eed by the seventh Or eighth cycle, the actual 

bs d image is almost indistinguishable from 
se. 


There are Some minor 
D-TOW cycles 
low, 


differences between our 
(3 and 3), and Bridgeman's. Be- 
Some minor differences between 
(Figure 3), and Bridge- 
Bone through extensive checking 
our own simulation (following 
scription of how he did his as 

convinced that our calculations 
The differences may be due to He 
ors: We used different computers 
l have led to a difference in word Tot 
significant digits, rounding errors, aen 
tting routine; and there may have a 
drafting errors in Bridgeman's prs 
; none of these differences are make 
7. A'Be to affect the arguments that we 


Number of 
ter and plo 
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of similarity includes the entire neural image, 
from one end of the network to the other: 
The activity of ecery element in the net- 
Work is considered, rather than, for instance, 
Just the activity in and around the target 
area, We shall have more to say about 
this below, The similarity comparison is 
made cycle by cycle, and a similarity func- 
tion is produced, which gives a value of 
similarity for pairs of neural images (target, 
target + mask) for each cycle.* 

Bridgeman (1971) divides masking into 
two types: The type associated with the 
homophotic neural images (forward and 
Simultaneous masking) and the type as- 
sociated with the metaphotic neural images 
(backward masking and metacontrast). Ac- 
cording to Bridgeman, homophotic neural 
mages allow subjects to make detection 
decisions, while metaphotic neural images 
have to do with the appearance of the tar- 
Set. It is not clear how the visual system is 
able a distinguish homophotic from meta- 
photic images; the images themselves can- 
hot provide a unique criterion for whether 
Or not a stimulus is physically present. But 
or the purposes of argument, let us assume 
that this problem can be solved. If we as- 
Sume, then, that the system is somehow 
able to tell the difference between homopho- 
He and metaphotic, in those instances where 


am E similarity comparisons are called cross- 
Boas but the reference appears to be to a 
rr e That is, if one computes the actual 
mur tn for the output generated by 
illustrated T (1971) network for. the 6 cycles 
256 404 in Figure 2, they are, in order, 1600, 
Semblas ; 172, 16, and T5. This bears no re- 
tried xS to Bridgeman's Figure 6. Next we 
nut oo variations on our cross-correlations, 
W e a them got us near Bridgeman's Figure 
Mae perhaps he had normalized the 
toonane y ne in SOME way, so we computed a 
" 2)/F( e correlation (Lathi, 1965) : F(x) * 
br 3 AE F (x), where * denotes convolution 
Chosen eke stimuli, this is the same as 
teston E ation), F(x) =the neural image in 
neural imac the target alone, and G(x) = the 
pain th ge in response to the target-plus-mask. 
Flavus pi o bers in no way resembled 
Son ys [ihe igure 6. We then computed Pear- 
Points F(0) AS ADS cross-correlation for the 
is all sant 3 (0)], and finally our results agree 
bk gu Jor respects with Bridgeman's Figure 6 
agree. s minor respects in which they do not 
» See footnote 3). 


331 


the mask is presented before or simultane- 
ously with the target, the homophotic images 
of the target are somewhat different from 
those of the target and mask together. In 
this case, detection drops. When the mask 
is presented later than the target, the homo- 
photic images are identical, but the meta- 
photic images of the target alone are differ- 
ent from those of the target-plus-mask, as 
can be seen in Figure 2. Here, then, the 
appearance of the target—its apparent 
brightness—is assumed to change, and one 
has the makings of a metacontrast siutation. 
(Actually, the model predicts that whenever 
detection drops, the appearance of the target 
also changes for those instances where the 
target is detected, since if the homophotic 
neural images of target and target-plus-mask 
are dissimilar, so are the metaphotic images. 
But the important point is that one can 
have appearance changes without detection 
changes, and this forms the basis of the 
metacontrast predictions. ) 

Thus the appearance of a stimulus is medi- 
ated by the late neural images (i.e., the late 
neural activity), and metacontrast occurs 
when these late neural images differ from 
what they would be if a target had been 
presented alone. How is a metacontrast 
function constructed from this formulation? 


Constructing the Metacontrast Function 


A metacontrast function is a plot of some 
measure of the change in appearance of the 
target (generally apparent brightness) for 
each delay between target and mask (for 
each At). No plots of this kind appear in 
Bridgeman's (1971) simulation. Rather, 
neural image similarity is computed cycle by 
cycle for one A£ in the backward masking 
range, one in the forward masking range, 
and one at simultaneous masking. The 
similarity function obtained is different for 
these three cases; in particular, for the back- 
ward masking case the later, metaphotic 
neural images of the target do not resemble 
the corresponding images of the target-plus- 
mask, This similarity function is shown in 
Figure 3. 

A similarity function is not a metacon- 
trast function, because it provides informa- 
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Ficure 3. The similarity function (what Bridge- 
man, 1971, calls the Cross-correlations) for what 


is assumed to be the metacontrast or backward 
masking case 


, this represents only the 
point At — 60 msec), "The points on this function 
neural ima, 
, disc-plus-annulus ) whi 
same cycle, 


in the top and bottom 
TOW (each Cycle) in Figure 
another Pearson r, 


tion about only one del 
But the information 

function can b 
tain a metacon 
cide on an 


ay (or value of At), 
from the similarity 
€ suitably Processed to gp 
trast function, We can de- 
Appropriate measure for m 
(a measure that computes from the simi- 
larity function for a particular delay a num- 
ber that Corresponds to target ‘apparent 
brightness for that delay) and then we can 
repeat this calculation on similarity func- 
tions for a Sufficient number of delays to 
give us a metacontrast function, 
If we take these calculations ste 
We must first decide On what 
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p by Step, 
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to compute from the similarity a 
The computation will yield a number, w 7 
will be assumed to correspond to target E 
parent brightness. One should remem - 
that the model assumes that the rg Ager 
brightness of a target is mediated by ea 
metaphotic images, and that masking > : 
lated to how similar the neural images tha 
Occur when a target is flashed in SCHIBBE- 
tion with a mask are to the neural images 
that occur when a target is flashed alone, 
However, the set of metaphotic images can- 
not be a necessary condition for target na 
parent brightness, since a steadily pem 
disc of light has a distinct apparent brigh = 
ness, and therefore homophotic stimulation 
is sufficient for the perception of apparent 
brightness. (This is related to the problem 
of how the System is able to distinguish 
homophotic from metaphotic stimulation in 
any case.) Soa suitable measure of ap- 
parent brightness for a target for a particular 
At might consider both homophotic and 
metaphotic neural images, and simply sum 
the similarity values, that is, sum the points 
computed for the similarity functions for that 
At. The sum would run from the onset of 
stimulation (the first cycle) through the 
Seventh or eighth cycle (the just-distinguish- 
able signal). The Sreater the similarity, the 
greater the value of the sum, the brighter 
the target, and hence the less the masking. 

The results of these calculations are shown 
in Figure 4. Figure 4a shows calculations 
for three Points in the original network; in 

i » We calculated all the other +Ats 
that can be generated by the network, These 
are 30, 90, 120, 150, 180, and 210 msec. 
(The entire set of Als is limited to about 
eight, due to the quantization of time and 
the fact that, at Als longer than 210, the 
target signal can not be distinguished from 
* It should h 
Various other 
through seven 


€ noted that we also tried wt 
combinations of cycles: dis 
» just the metaphotic oo ese 
d so forth. None of houl 
es the argument. However, one 5 and 
measurement to the later grecs the 
measurement should stop at ahan even 
Or eighth cycle, Hs large 
though the Pearson ; al iS 
Negative or p 
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Fictre 4. Masking, as computed from Bridgeman's (1971) similarity 
functions. Each point represents the sum of Pearson rs at a particular At 
from the first through the seventh cycle. Figure 4a shows what one would 
obtain if attention were restricted simply to the three values of Af that 
Bridgeman considered; Figure 4b shows the masking functions one obtains 
when one computes the sum of points on each similarity function for all the 
other Ats that can be generated by Bridgeman's network. The dotted line 
shows the masking function obtained when both target and mask are pre- 
sented for one cycle of the network; the solid line shows the masking 
function obtained when both target and mask are presented for two cycles 
of the network (allowing the network one application of inhibition before 
stimulation is removed). These can be thought of, respectively, as a target 
and mask presented for durations of 0 <t< 30, and 30 <t € 60, where t= 


duration. 


noe: J In addition, we calculated the same 
oi Ats for another duration, namely, a 
uration less than or equal to 30 msec, or the 
condition where the network does not have 
a chance to initiate a second cycle before the 
Stimulus is turned off. 
pene arrows in Figure 4a suggest the 
ce of a U. But when one extends 
these calculations to include the complete 
Set of Ats possible in this network, it be- 
eg apparent that the function reverses 
dt two or three times (rather than just 
des produce the U) and thus does not 
empirical metacontrast functions which are 
Ptoblen y obtained. Let us examine this 
lem, and related temporal problems, in 
a little more detail. 


Shape of the Metacontrast Function 


etw í . 
" Networks such as the one in Figure 2, 
1 inhibitory feedback occurring atter 


some short, discrete delay, are subject to 
damped oscillations or temporal “ringing.” 
That is, a point on the neural image will, 
with sustained stimulation, first fire at a cer- 
tain rate; on the next cycle, it will fire less; 
on the succeeding cycle, it will fire somewhat 
more again; and so on, This ringing allows 
strong masking to develop. Pairs of neural 
images (target, target-plus-mask) at par- 
ticular cycles can become quite dissimilar, 
with corresponding points on the neural 
image even having differing signs (see 
Figures 2 and 4, cycle 6 for instance: The 
Pearson r for this cycle goes negative). But 
ringing also results in the metacontrast func- 
tion itself oscillating a number of times. If 
for a given Af, the neural images produced 
by a target-plus-mask resemble the neural 
images produced by a target alone, then for 
the next Af, the neural images of the target- 


plus-mask will be out of phase with those 
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of the target alone, and hence metacontrast 
masking at successive Afs will first be weak, 
then strong, then weak again, then strong 
again, and so on. But metacontrast func- 
tions do not oscillate (Alpern, 1953; 
Growney & Weisstein, 1972; Schiller & 
Smith, 1966; Weisstein, 1971, 1972; Weis- 
stein, Jurkens, & Onderisin, 1970). It is 
unlikely that if metacontrast functions had 
two or three major troughs and peaks they 
would have been missed by the sampling of 
Ats utilized by the various studies. For 
instance, Figure 1 shows empirical metacon- 
trast functions (solid lines) which contain 
27 Ats, 21 of them spaced in 10 msec incre- 
ments, from —100 to +100 At. In the de- 
sign from which these data are drawn, all the 
Ats are presented in random order five times 
per At per target luminance per subject. 
While there are some ragged edges in Sub- 
ject TJ’s functions, there is no evidence of 


the kind of abrupt reversals shown in 
Figure 4b, 


Other Temporal Properties of the Network 
What matters for a met 
is the ratio of target durat 
tion (with luminance held 
we will call T/M. 
target and mask ar 
T/M = 1, the same 
function is obtained, 
durations of target ar 
matter what T and M (Kahneman 1967 
Weisstein, 1972; Weise; & Gr SERE. 
1969), Thus a tar bm 
Set and mask c 
sented for 25 mse 
each, and the diffe i 
functions is negli 
On the other h 


acontrast function 
ion to mask dura- 
constant), which 
When the durations of 
e equal, that is, 

U-shaped metacor 


+ Metacon- 
ange shape (they become 
sing with increas 
Neither o 
I5 generated by 
k. 


cles in thi 
" ka network 
W hile there 1S nothing Wrong with ap ens 
mating time by discrete Steps, the x of 
5, assume: 


duration of each step is much too large to 
simulate the pertinent features of metacon- 
trast. This is especially true for the first 
30 msec, when T/M is the most sensitive to 
variations in duration, For instance, if one 
presents a target for 1 msec and a mask for 
16 msec, a monotonic function of +At will 
be obtained; if one presents a target for 16 
msec and a mask for 16 msec, a U-shaped 
function of --A/ will be obtained (Weisstein, 
1971). This network cannot distinguish be- 
tween these two cases. 

Target and mask of equal duration (T/M 
—1). At the jump points of the network 
—30 msec, 60 msec, 90 msec—the neural 
images change. This means that 
holds T/M constant at 1, 
30 msec are 


if one 
but multiples of 
added to both target and mask 
(that is, one changes the number of cycles 
the network goes through while each stimu- 
lus is present), different metacontrast func- 
tions will be obtained, Fi 
this: The solid line shows 
tion obtained by allowin 
receive the target (30 ms 
cycle once while the t 
for a total target 
The dotted lin 


arget is still present, 
duration of 30 X t « 60. 
j e shows the masking function 
obtained by allowing the network no recy- 
cling while the target is on: It c 
ceives the target (0c, < 30). 
case, the mask js Presented for the same 
duration intervals as the target. Since 
T/M = 1, the masking functions should be 
identical, However, as Figure 4h shows, 
the functions are quite disparate, This con- 
tradicts a considerable body of empirical 
evidence (Weisstein, 1972). 
hese tempora] problems are not caused 
by the fact that time is quantized in this 
network, but rather by the dependence of 
the recurrence Of inhibition on the stepping 
of time, Hence, the Problems are not solved 
by changing the cycle time, that is, by de- 
creasing or increasing the duration between 
Steps, the cycle time longer will 
asking at Afs which are too great 
to Correspond to 
Þirically, 
ficulties jn Predicting monotonic functions a5 
T/M50 Ma 


A : shorter 
aking the cycle time sho 


simply re- 
In each 


Making 
Produce m 


t 1- 
What has been found p^ 
and it will only increase the di 


Will only translate the metacontrast func- 
tions leftward and increase oscillation fre- 
quency. This can be illustrated in Figure 
4b: Shortening the cycle times, for example, 
would be equivalent to rescaling the -v-axis. 
If this were done, and each cycle time were 
assumed 5 msec long (so that the intervals 
were —10, —5, 0, +5, +10, and so on) it 
Would solve neither the problem of oscilla- 
tion nor the problem of differing metacon- 
trast functions for T/M — 1 for different 
target-mask durations. But shortening the 
Cycle time would introduce additional prob- 
lems: Masking would occur too soon, and, 
at the delay at which masking should occur, 
the network would have already gone 
through so many cycles that the signal 
would be too damped to be distinguished 
from noise. 


Spatial Properties 


The network exhibits spatial as well as 
temporal ringing. This means that it does 
hot predict the spatial properties of meta- 
Contrast either. Empirically, as the separa- 
tion between target and “mask increases, 
Metacontrast decreases monotonically. The 
network, on the other hand, produces abrupt 
Teversals in masking as the separation be- 
tween target and mask is increased, unit by 
unit. So, for instance, masking is greater 
at spatial separations of three neural units 
eEween target and mask than it is at sepa- 
Eon of two units. These abrupt reversals 
are again a function of the ringing of the 
Network, especially at the edges of the target 
and mask; when these neural images are in 
Phase, masking is slight; when they are 
Out of phase, masking is heavy. 


Tha Linking Hypothesis: Correlation as an 
"dex of Apparent Brightness 


ti Even if the temporal and spatial proper- 
iue of metacontrast were better predicted by 
tae sum of Pearson rs for Af =60 msec 
at a Beman's condition, 1971) for seven cycles 
and age POOR. of two units hetween target 
between. t Ti 3.75; at a separation of three units 
Spects tl arget and mask it is 294. Tf one in- 
solid is metacontrast function for this condition 

to the HSS m Figure 4b), it is clear that relative 
le rest of the points, this is a large difference. 
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this network, the way neural image simi- 
larity is measured cannot be justified. For 
each cycle, a Pearson r is computed for the 
neural images obtained in response to the 
target alone versus those obtained in re- 
sponse to the target-plus-mask. What this 
means is that the value of each element in 
the network for a target neural image at a 
particular cycle is multiplied by the same 
element in the network for a target-plus- 
mask neural image at that cycle, and then 
the products are summed across the entire 
network and normalized. Consequently, the 
measure will produce masking no matter 
what is added to the target, as long as it is 
presented somewhere else in the network 
(and, at least until the cycles spread the 
neural image to the location in the network 
where the additional stimulus is located, it 
doesn't matter where else in the network the 
additional stimulus is presented). That is, 
for this measure, the magnitude of masking 
is not affected by the separation between 
target and mask until the signal is spread 
far enough, Thus, within some finite num- 
ber of cycles one could simply add a second 
stimulus at the very edge of the network, or 
some place in the middle, and the same 
amount of masking would be produced. In- 
deed, one could add two, three, many stimuli 
at the edges of the network and produce 
lower and lower correlations. 

This does not agree with the evidence. 
Adding flanks in metacontrast only decreases 
the apparent brightness of a target when the 
flanks are close to it (Alpern, 1953; Weis- 
stein & Growney, 1969). As the spatial 
separation between target and mask in- 
creases, masking declines sharply. Further- 
more, when disc and annulus are presented 
simultaneously at the same luminance and 
duration, enhancement may occur (Mat- 
thews, 1974). 

It might be more reasonable to inspect the 
change in neural activity only for those units 
in and around the location of the target.* 


7 It could be argued that the information about 
the target might not be restricted to a retinally 
localized area. Such distributed encoding schemes 
are discussed by W ein (in press). Distribu- 
tion of target information over an extensive re- 
tinal area may be a possibility, but then, generally, 
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When this is done, the correlations remain 
near 1.00 for both the simultaneous condi- 
tion and for 60 msec Af. Cycle by cycle, 
for 60 msec Af, they are 1.0, 1.0, 1.0, .97, 
932, 997, 999. This hardly constitutes a 
reduction in similarity. 


Summary of Metacontrast Simulation 1 


This model fails to simulate a metacon- 
trast function, and it predicts neither the 
spatial nor the temporal properties of meta- 
contrast. In addition, masking is predicted 
only by the use of an implausible linking 
hypothesis, ‘ 

Many of these flaws have specifically to 
do with the way in which this particular 
simulation is constructed and the assump- 
tions behind it, But some of the difficulties 
might beset any single-channel network, 

hich essentially repeats the physical stimu- 
lus (with some modification), Such a net- 
Ork preserves information, but it can 
hardly be thought of as processing informa- 
tion. That is, different features of a stimu- 
lus, which might Provide some basis for 
deciding what kind of stimulus has been pre- 
sented and when that stimulus has been 
presented, are not independently represented, 
at least at the level where the network is 
still single channel, 

We can use Bridgeman's (197 
as an example of the 
create. In Bridge 
brightness is distin 
à stimulus- 
turned off, 
able for two 
channel alone, it 
guish which js the 
the metaphotic si 
tion that met 
parent brightness 
fact that if à stimulus is left on 
lor a subject to make 
subject wj] 
apparent 


problems this may 
apparent 
tection hy 
timulus is 
is unten- 
with a single 
ble to distin. 


a response 
l respond that it has 
brightness i 
parent shtness, Making 
things stimulus-dependent may be one of the 
few Ways one can distinguish between diffe 
o be f- 


a distinct 
He Wever, 


more than one cha 


use a more plausible linking hypothes 
Such. distributed encoding. to perceptual 
P ptu 


is to re]; 
al events, 
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ent stimulus properties while preserving the 
notion of a single channel. 

It is entirely possible that a more con- 
vincing single-channel model can be con- 
Structed that will overconie some of these 
problems. Meanwhile, however, it might be 
worthwhile to turn attention to the other 
metacontrast simulation (which assumes 
multiple channels and a rather straight- 
forward linking hypothesis) and see if this 
simulation. does any better, 


Metacontrast Simulation 2 


The other metacontrast simulation (Weis- 
stein, 1968, 1972) assumes that there is 
mutual inhibition between fast- and slow- 
responding neural populations, In particu- 
lar, for metacontrast it assumes that units 
Whose response is fast inhibit units whose 
response is slower, Hence, one must delay 


the stimulus for the fast-responding units in 
order to have them 


interfere with a prior 
stimulus. 


Quantification and Criticisms 


These assumptions were worked | out 
quantitatively in a neural network incorpor- 


ating Rashevsky-Landahl two-factor neurons 


—neurons whose temporal response is a 
combination of excitation and inhibition 
(Weisstein, 1968, 1972). "phis network 


Successfully predicted 
ot metacontrast. such 
the metacontrast f, 
Shape as T/M 


à number of properties 
as (a) the shape of 
unction and its change of 
Changes (as in Figure 1); 
b) the reappearance of the target when 
target and mask are repeatedly presented at 
eerste Als (as in Schiller & Smith, 
9) ; and (c) for 34 out of 35 data points 
that appeared in the metacontrast literature, 
the Af at which the minimum of the U oc- 
curred, This model is continuous, so any 
Combination or target and mask luminance, 
duration and Af can be simulated.’ It does 
* Inspection 9f the predict 


(1968, 1972) model 
glance, seem 


ions from Weisstein è 
in Figure 1 might, at firs 
to indicate that the same type O 
me applies to her model as weli 
t the case, Weisstein’s network p 
d on an analog computer, and within t^ 


mn of the Computer, any duration and 
© simulated, f 
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TARGET 


Figure 5. Expanded version of Weisstein's (1968, 1972) model, which 


clarifies the spatial symmetry implicit in that model. 
neurons are those which also appear in Wei 
than me; therefore, stimulation for ns must arrive later 


ne responds fi 


The subscripted 
stein. (1968, 1972). Neuron 


than stimulation for ms in order that the response of ms be suppressed. It 
one traces responses in the unlabeled neurons, it is clear that an identical 
effect will occur for the bottom neuron similar to nu if the stimulus in the 


top row is delayed. 
duce the same result. 


ae on temporal ringing to produce 
Sie je and thus jt is not subject to the 
iat emporal difficulties as the network 
of ni sa earlier. In addition, the measure 
io ike brightness Is proportional 
‘ited CHECK ot firing ol the unit desig- 
target a E information. about the 
Hilo so the linking hypothesis is not quite 
; problematic as it is in Bridgeman's (1971) 
Yetwork. 
aoe there are two criticisms of this 
taam es One has to do with what are 
Sexes | to be the spatial constraints im- 
by the model, and the other with a 
ner assumption behind some of the 
ameters chosen for individual neurons. 


Spatial Interaction 


sien Criticism here has been that “the 
ara bio will not work if target and mask 

erchanged" (Bridgeman, 1971, p. 
e 2 seems to us easily solved. Weis- 
action (io for the metacontrast inter- 
symmetry 72, p. 248) is not symmetric, but 
tained Ty ds implicit, and it is easily ob- 

without making any additional as- 


Thus one can interchange target and mask and pro- 


sumptions, Figure 3 depicts the following: 
Somewhere, some place in the central ner- 
vous system, two neurons receive the same 
excitatory input. The response of one of 
these neurons is faster than the response of 
the other, and there is cross-inhibition from 
these fast responders. That is, the neurons 
that are responding faster inhibit their suc- 
cessive neighbors, Because their response 
is fast, inhibition will be effective only if the 
stimulus for these fast responders is delayed. 
Working through the various combinations, 
it is immediately obvious that the network 
is symmetrical: All that is required for one 
stimulus to suppress response to another is 
that it follow the other. (Although the 
schematic shows only one line of units re- 
sponding to target and one line of units re- 
sponding to mask, it is assumed that there 
will be a collection of units responding to 
any stimulus.) 


The Assumption of Fast Inhibition 


Consider now the temporal assumption, 
Weisstein chose rate constants for the in- 
hibitory factors in most of the neural units 
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that were faster than the rate constants for 
the excitatory factors. Since the inhibitory 
and excitatory rate constants are supposed 
to correspond to inhibitory postsynaptic po- 
tentials (IPSPs) and excitatory postsynaptic 
potentials, (EPSPs), Tespectively, 
amounted to an assumption that IPSPs were 
faster than EPSPs, Although there is some 
evidence for fast IPSPs (Creutzfeldt & Ito, 
1968), in general the assumption of IPSPs 
being faster than EPSPs appears tenuous. 
However, there are other ways to pro- 
duce fast and slow responses. With two- 
factor neurons, the particular rate and shape 
of the neural temporal response depends 
Most critically on the relationship between 
the ratios of excitatory gain to excitatory 
Tate constant versus inhibitory gain to in- 
hibitory rate Constant, and only secondarily 
on the rate constants themselves. One can 
produce a fast response in a neuron—that is 
(as illustrated by no in Figure 5); & rë 
sponse in which firing frequency reaches 
some criterion amplitude rapidly after stim- 
ulus onset and decreases to some criterion 
amplitude rapidly with stimulus Offset, so 
that the latency of response is short, and the 
temporal resolution is high—by adjusting 
the gains in that neuron (4, B), the rate 
constants (a, b), or both. This can be seen 
by solving Equations 2 and 3 in Weisstein 
(1972), which would give the functions for 
e and j, that is, for the excitatory and in- 
hibitory factors that combine to give the 


neural tempor. The solutions for 
ing initial condi- 
amiliar exponent: 


)=9 Qs [CA/a) 

(= gal — [(B/b) (1 mo 
where A is the gain 
stant for the excitato 
gain and b is tl 
hibitory factor, and e — 271g 
assumes, for j 


this 


al response, 
a step function input, sett 
tions to zero, are the f. als, 
2: » 
Response = ¢(; 


and a is the r 
Ty factor, and B is the 


- Now, if one 
nstance, that a 2 b (that į 
m t at is, 
EPSPs that are considerably faster 


IPSPs), and 4 > B, such that Phu 
slightly larger than B/b, a fast, sharp tran 
ent excitatory response win be Produced, 
Which will then die down to some minimal 


N. WzrssrEIN, G. Ozoc, anp R. Szoc 


firing slightly above spontaneous level for 
as long as a stimulus is maintained. à 

On the other hand, if one assumes a > ”s 
and 4 > B, such that A/a > Bjb, a slower, 
more sustained response will develop. Since 
a number of afferent orders are assumed to 
be traveled before the metacontrast inter- 
action, one could enhance these differences 
by further convolution (the fast response 
would get somewhat slower; the slow d 
sponse would get a great deal slower). Thus, 
there are a number of different choices of 
parameters that will give fast and slow re- 
Sponses. But one does not even have to 
adjust parameters relative to each other to 
produce Weisstein's results, 
tain fast and slow responses simply by as- 
suming slower conduction times in the slower 
responding neurons, Either (or both) of 
these alternate Ways of producing a fast re- 
Sponse would only minimally affect the pre- 
dictions from the Weisstein network, since 
the narrowing of the functions and their 
Shift towards the origin, with T/M — 0, 
as well as the other temporal properties of 
metacontrast that Were predicted, are due 
not so much to the temporal shapes of the 
slow and fast neurons (responses that are 
influenced by the rate constants), but to the 
neurons’ nonlinearity (Weisstein, 1972, 
Equation 4, P- 246) and to the fact that the 
response to a stimulus, even if it were transi- 
ent, would nevertheless smear out somewhat 
5 ange when the energy of the mask is high. 

T he nonlinearity means that the functions 
will first he wide and then narrow as mask 
energy increases; the “smearing” of the 


Mask in time means that one will not have 
to delay present 


ation of the mask as much, 
as its energy increases, in order to have it 
Interfere with the target, and thus the meta- 
ns will shift their minima 
n, as in Figure 1, 
In short, alternate Ways of producing à 
fast Tésponse are easily available; the focus 
on fast IPSPi gr "fast inhibition” (Weis- 
» Obscures the main point that 
€ that has to be fast in Weis- 
is a fast excitatory respons 
inhibits 4 slow excitatory T€ 
Construct this, one can choose 


One could ob- 


, 


stein's model 
Which then 
Sponse, To 
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parameters in several ways, only one of 
which involves choosing faster ISPs. 

One could decide on one or another of 
the alternative ways of producing a fast re- 
sponse by drawing on the accumulating data 
about the properties of transient and sus- 
tained neural populations in the visual sy 
tem (see below), and the equations for the 
two-factor neurons could be revised in light 
of this evidence. However, for now, since 
alternate constructions will yield roughly 
the same results, we prefer simply to note 
the equivalence of the different ways of 
looking at what was called fast inhibition. 


Further Elaboration of Metacontrast Simu- 
lation 2 


Edges and blobs. The main assumption in 
this model is that more than one temporal 
Weighting function is needed to characterize 
Visual system response (ie, fast versus 
slow response). So the simulation involves 
Multiple temporal channels, two classes of 
which are assumed to interact. But there 
are spatial assumptions as well. Weisstein 
(1968, 1971, 1972, 1973) proposed that the 
Units that mediated detection of a stimulus 
had different time constants than the units 
that carried information about contour or 
edge. She assumed, as Werner had sug- 
gested (1935) that the basis of metacontrast 
pios DINAR of edges or contour 

ormation, and that apparent brightness 
decreased as some function of the suppres- 
Sion of edge dges have long been known 
io play a critical role in the perception of 
apparent brightness (e.g., Cornsweet, 1970; 
Land & McCann, 1971; O'Brien, 1958; 
Shapley & Tolhurst, 1973). 

If apparent brightness is largely dependent 
es then “edgeless” stimuli should be 
TI $e resistant to metacontrast masking. 

his seems to be the case: Growney (in 
press) has shown that one obtains negligible 
amounts of metacontrast masking when the 
€i edges of a target are blurred. Mask- 
ig Is not as negligible when the target edges 
vii sharp and the edges of the mask are 
lurred, Thus relatively "edgeless" stimuli 
inp are not completely edgeless, especially 

one analyzes edges in terms of the amount 


of high spatial-frequency components they 
contain) do not get masked very much.? 
The slow response in Weisstein's model 
(1968, 1972) was assumed to have some- 
thing to do with contour or edge. Evidence 
has since accumulated that high and low 
spatial irequencies may be processed by dif- 
ferent temporal channels (e.g.. Kulikowski 
& Tolhurst, 1973), and that these different 
temporal channels may respond with dif- 
ferential speed and resolution (e.g., see dis- 
cussion by Breitmeyer, in press; Breitmeyer, 
Love, & Wepman, 1974; and Breitmeyer, 
Note 1; also see Hood, 1973). A high 
spatial-frequency response may be thought 
oi (very roughly) as a response to edges, 
and a low spatial-irequency response may 
be thought of (just as roughly) as a response 
to non-edges—blobs, or large, blurred 
shapes, as in a defocused picture.” Thus, 
given a single stimulus, the response to its 
edges may be slower than the response to its 
“blob presence." Hence, both an edge re- 
sponse and a non-edge response may be tied 
to the accumulating evidence for the pres- 
ence in the visual system of channels that 
respond to transient, low spatial-frequency 
stimuli, and channels which respond to 
stained, high spatial-frequency stimuli, (This 
direct association of fast and slow responses 
in metacontrast with transient and sustained 
channels was first suggested by Breitmeyer 


9 Growney (in press) has presented. quite inter- 
esting evidence that the specific type of edge in 
metacontrast in both target and mask can change 
the amount of masking obtained. While some of 
this is easily explained by the edge-blob scheme 
discussed below, some of this may require further 
elaboration. 

19 Actually, both edges and blobs will have low 
frequency components, but edges will also contain 
more high frequency components. Edges and 
blobs can be specified much more precisely, of 
course—íor any given edge or any given blob, one 
can calculate its spatial frequency or Fourier 
spectrum. Similarly, the units sensitive to edges 
and blobs can be defined by the spatial frequencies 
to which they respond. But until we know con- 
siderably more about these units’ spatial frequency 
response—bandwidth, channel density, variation of 
temporal response with center frequency (see 
Graham, in press, for an elegant summary of our 
ignorance on these matters, and an excellent dis- 
cussion of how to overcome it)—we lose nothing 
in precision by talking about edges and blobs, and 
we may gain in comprehensibility. 


et al, 1974, and Breitmeyer, Note 1. A 
different approach to the relationship be- 
tween masking and activity in the sustained 
and transient channels has been Suggested 
by Matin, 1974. and Matin, Notes 2 and 3.) 

It should be noted that transient responses 
do not necessarily imply Speed, nor do sus. 
tained responses imply delay ; however, 
there is evidence that the higher spatia]. 
frequency channels Tequire a longer 
integration time than low Spatial-frequency 
channels (€.g., see Breitmeyer, in 


there is neurophysiological evidence that the 


populations Sustained, 

Spatial-frequency stimuli (the Y and X 
1o pectively—Enroth-Cugejt & Robson, 
1966. For evidence Concerning conduction 
velocity and latency of these populations, see 
Cleland, Dubin, & Levick, 1971, and Fukada, 


1971; hese populations 
may mutually inhibit each other, see Singer 


, 


press). 
high spatial-frequency 


gratings of lower Spatial 
reaction-time data 
times for edges versus non-edge 
contain more hig! 


Masking in G eneral 


The associ 
With the acci 
and transien 
Channels, Which w 
scheme, has enorm 
It may make it possible t 
phenomena that have not 
been treated; and ; 
relation between 
Metacontrast (see 


ation of slow a 
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1974; Kahneman, 1967, 1968; Weisstein & 
Growney, 1969; Matin, Notes 2 and 3, for 
discussions) SE 

As long as channels are near enough to 
each other to inhibit each other, the edge- 
blob scheme works for metacontrast, as 
stated: U-shaped backward masking occurs 
because the fast part of the response to the 
mask inhibits the slow part of the response 
to the target, This can be easily extended 
to nonmetacontrast designs, where U-shaped 
masking functions are also obtained (Pur- 
cell & Stewart, 1970; Weisstein, 197] ; Tur- 
vey, 1973), that is, designs where the target 
and mask partially or completely superim- 
Pose on the same retinal areas, Separate 
channels are assumed mutually inhibiting ; 
as they approach each other's retinal loca- 
tions (including superimposition), inhibition 


ain the same, 
atial-frequency 
in press), For 
ape would be pre- 
dart of the response to the 
fast part of the response 


depending on one’s specific sp: 
assumptions (e.g., Weisstein, 
forward masking, a U sh 
dicted if the slow į 
mask inhibited the 
to the target, 


Here, one might assume th 
target blob response would 


the target eq 


at since the 
be inhibited 
5€ response, as in 
» the forward analog of 
me acontrast—paracontrast where the mask 
Precedes the target—would not be as great 


as Metacontrast, e it is the edges of a 
Er 


"Tt shou d 


sinc 


be noted that although the scheme 


I 
generalizes beyond typical metacontrast backward 
masking, jt does not C 


1 nor is it meant to) account 
for Masking that is th ; 


» depends pri- 
D of the target and mask, pro- 


montonic functi so forth (see 
‘ahneman, 1968, 1972, for a re- 


» the edge-blob distinction 
bs to target detection, as 
C assumed from the earlier model (Weis- 
Stein, 1968, 1972) where the basic distinction 
ji en contour interactions and detection. 
» masking that involves simple detection 

T as a result of different mechanisms toan 
; in any case, at the p 
© considering, we assume t im 
3S and edges Play a role in target al 
ce, 


Two Movers or METACONTRAST 


Stimulus rather than its blob presence that 
Seem to play such an important role in 
determining apparent brightness and the 
other features of the appearance of the tar- 
get which get disrupted in metacontrast. 
On the other hand, if the appearance of the 
target is even minimally dependent on blobs 
as well, then, although one might not expect 
Much paracontrast until the mask became 
quite strong, when the mask did become 
DUM, forward U shapes would be expected. 
urn the mask produced a response 
— much greater than the target re- 
hon pae might predict the same kind of 
i5 back Mie bottom of the U that one finds 
mr masking. This seems to be 
^ a Yappens, In Figure 1, the forward U 
S either absent or appears as a minor dip 
compared to the backward U when T/M = 
T mur reaches a respectable size as 
a, ecreases substantially ; then, with 
T decreases in T/M, the bottom of the 
begins to shift towards the origin. So, 
€dge-blob scheme might be able to ac- 


Cou r 
E Y both for forward and backward Us, 
anc for their asymmetry. 


the 


Sup there has always been the nagging 
rage il ui footnote 1) that metacon- 
insist d vs not really exist. In a metacon- 
al pe while it is clear that target ap- 
that "bus ae decreases, it is also clear 
lox ol I: something going on in the loca- 
measure der target, and indeed, detection 
trast hun P ely produce U-shaped metacon- 
is geier ae on _ Also, apparent movement 
When the y noticed in metacontrast, even 
ligible on apparent brightness is neg- 
Room ; he edge-blob explanation could 
the ed ra both of these things: If only 
thing poi a target are suppressed, some- 
would na eed remains. That something 
ihe thg target blob. But since blobs 
charinel a stronger response in the transient 

S, those which prefer moving stimuli, 


here is 7 
ia 1$ reason to suppose that the continued 
üctiv 8 

Vity of these channels during metacon- 


Tas asl A : 
m St masking would lead to the impression 
movement, 


CONCLUSION 


ag Stimulus produces prolonged neural 
ity. The notion that different perceptual 
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events are associated with different portions 
of this activity is intriguing, and it lies at 
the basis of all of the theories of metacon- 
trast discussed in this paper. The major 
difference in the various theories has to do 
with assumptions about visual system organ- 
ization. Here, Bridgeman’s (1971) model 
appears to fail in important respects, whereas 
Weisstein’s (1968, 1972) model does not. 
While much of this may be due to the 
particulars of the way Bridgeman set up his 
model, some of the failure does appear inti- 
mately connected to the general notion that 
a single spatial and temporal channel de- 
scribes visual system activity. On the other 
hand, Weisstein's model (with subsequent 
elaboration) not only offers the prospect 
of simulating many of the relevant spatial 
and temporal aspects of metacontrast, but 
agrees with much current investigation in- 
dicating both the presence in the visual 
system of multiple spatial and temporal chan- 
nels (Campbell & Robson, 1968; Graham & 
Nachmias, 1971; Kulikowski & Tolhurst, 
1973; Pantle, 1971), and also (for spatial 
channels at least) their mutual inhibition 
( Tolhurst, 1972). It is possible that by con- 
ceptualizing metacontrast in terms of mul- 
tiple spatial and temporal channels, we may 
get a clearer idea of the variety of perceptual 
functions served by these different channels. 
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d, & Neff, 1957; Meyer & ' 
From an anatomical view, 
al centers that apparently 
ce of functional restitution 
nsive and extends from the 
level of the spinal cord and brain stem to 
diencephalic centers and neocortex, as ex- 
emplified by the research noted above, 

On the basis of these and similar data, 
the present discussion accepts the validity of 
recovery of function following brain damage 
and the influence of certain parameters, such 
as infaney and seriatim damage, and from 
this point will consider What process or pro- 
cesses may rite the phenomenon it- 
self, histological accreditation 
1 nor the delineation of 
Operational parameters which 
process will be considered. 
àve been addressed by Rosner 
and little can be added to these 


underw 
Neither the 
of the phenomeno; 
the various 
influence the 


st em 
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€ primary interest of fhe 
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s that may be disrupted by 
ific brain areas, A classic 
: in the tradition of K, S, Lashley: 
1s the recovery of a two-choice brightness 
discrimination following striate decortica- 
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one of two brightnesses in a two-choice dis- 
rte box. _ After the animal has 
, Stered the discrimination, it is subjected 
k tye of some portion of its brain— 
iA present instance the striate neocortex 
Eo behavioral effects of this neural 
‘mage are assessed by later testing in the 
discrimination box, | T 


eius typical GU Postoperatively, this 
animal tap ii indicates that the striate 
oes — unable to perform the 
ket E E x. SUSPEN but with some 
tive sob e to reestablish the preopera- 
ea matory behavior. : 

ener ee these results have been in- 
je rl sa often than not, as indicating 
"ith B of functional reorganization. 
cept pe the above example, this con- 
ilav i A that initially the brightness 
d a by neurological mecha- 
ls Slats n the striate neocortex, When 
neurological neocortex is destroyed, these 
ehavior 2 mechanisms are lost, and the 
Ost. The X supported is also necessarily 
havior M is able to recover the be- 
level, the ei ‘because ou an operational 
Neurological ^s n erate s response to the 
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other inm E^ "i ‘Soiripanenk parts so that 
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Postoperati 1e destroyed neural tissue, The 
ishment e ronne thus allows reestab- 
because thes ps Svightness discrimination, 
centers Si Hüüelinwall reorganized neural 
Properties of ee over the operational 
general sence the lesioned brain area. In. a 
Somewhat e ids sort of explanation, while 
in its inr a) has a nontrivial elegance 
of rre “pe and gains a certain amount 
rom the plasticity the nervous 
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in the cs o denote a functional change 
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er or area. This is not, of course, 


345 


to deny experiental modification or learning, 
since this sort of plasticity is well within the 
design characteristics of neural tissue and 
moreover is not a property unique to the 
nervous system, Rather, the problem rests 
on the plasticity necessary to underwrite 
junctional reorganization, which is some- 
thing much more profound and quite dif- 
ferent, requiring the modification of the de- 
sign characteristics of the system itseli—a 
property which, if it were to exist, would be 
most unique. Second, and more important, 
this design plasticity, which is a requisite to 
functional reorganization, is incredibly dif- 
ficult to support with any sort of direct em- 
pirical evidence. That is, while the neuronal 
plasticity suggested to account for learning 
is often precipitated by the direct observa- 
tion of some particular physiological re- 
sponse of neural tissue, the concept of func- 
tional reorganization does not enjoy any 
such complementary justification. In point 
of fact, the only evidence for functional re- 
organization is recovery of function, and 
this is obviously quite circular. 

In contradistinction to the notion of func- 
tional reorganization, the present paper 
argues that the nervous system is not as 
unique as this concept suggests. Rather, it 
is suggested that the mature nervous sys- 
tem, like other mature biological systems, 
is quite stable after some critical point in its 
embryological and/or neonatal development. 
Accepting this, the present paper then pro- 
poses that recovery of function following 
brain damage depends on the sparing of the 
neural mechanisms responsible for the be- 
havior patterns prior to the neural damage. 
In other words, recovery of function follow- 
ing brain damage does not reflect the func- 
tional reorganization of neural tissue but 
rather the survival of the neural tissue that 
mediated the behavior patterns prior to the 
neurological lesion. 

The remainder of this paper attempts to 
support these notions of functional stability 
and neurological sparing as determinants 
of recovery of function following brain 
damage. To do this, two things must be 
accomplished. First, it must be shown that 
the mature nervous system is functionally 
stable, Second, a case must be made that 
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behavioral recovery represents no more or 
no less than the continued normal operation 
of the surviving brain centers. In this lat- 
ter instance, it is specifically suggested that 
the recovery of behaviors disrupted by 
neural lesions is the result of the fact that 
the neural mechanisms supporting these be- 
haviors were never really destroyed in the 
first place. 


STABILITY WITHIN THE MATURE 
NERVOUS System 


The evidence suggesting that the mature 
nervous system is functionally quite stable 
stems principally from three sources: (a) 
neural regeneration, (b) early experience, 
and (c) compensatory neural sprouting. As 
will be apparent, all of this research describes 
very dynamic aspects of the nervous system 
and yet, at the same time, provides quite con- 
vincing evidence that these dynamic aspects 
do not result in the functional reorganiza- 
tion of established neural centers. And, as 
should be apparent, this failure to support 
functional reorganization is in precisely 
those areas of research that should demon- 
strate it if it were a viable process. 


Neural. Regeneration 
While 


regeneration following neural 
damage is not ordinarily considered a prop- 
erty of the mature mammali 


an central ner- 
vous system, it is readily observed in the 
mammalian peripheral nervous System and 
in the central nervous System of lower verte- 
brates, Of critical import for the present 
discussion is not that there is regeneration 
but rather that regeneration, when i 
1s quite specific. "This was r 
( Dustin, 1910; Held, 1905 ; 
cited in Grossman, 

considerable controv 
mechanisms involved, 
relying on 


t Occurs, 
ecognized early 
05; His, 1889 ai 
1967) and generated 
ersy concerning the 
Mechanical theories 
; stereotropic factors (Haiden- 
hain, 1911—cited in Grossman, 1967) were 
advanced but proved somewhat inadequate 
in the lace of data indicating that the bii 
growing amoeboid nerve endings did n t 
ion follow ae path of least Tesiste 
Frossman, 1898—ci i 
cited in Grossman, 


iate : ri : 
1967). Neurotropic theories involving 
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chemical specificity, as initially proposed by 
Ramón y Cajal (1928/1959) and later A 
ported and refined by the classic. research 0 
Weiss (1936) and Sperry and pr 
tors (Sperry, 1947, 1950, 1955, er ; 
Sperry & Hibbard, 1968), to name but the 
more familiar, now appear considerably 
more likely. However, whatever the mech- 
anisms, and the options have been care- 
fully considered by other authors, the point 
remains that whether it is mechanical, 
chemical, or some thing such as neurobio- 
taxis (Kappers, 1917), neural regeneration 
is not a haphazard affair. On the contrary, 
it is quite specific and when it occurs, as 
when eyes are rotated, an incredible degree 
of functional stability is maintained, even 
when this stability is contrary to adaptive 
behavior (also see recent research of Mark 
and his associates [Cass, Sutton, & Mark, 
1973; Mark & Marotte, 1972; Mark, Mar- 
otte & Mart, 1972; Marotte & Mark, 1970a, 
1970b]). If the mature nervous system is 
dynamic or plastic enough to allow func- 
tional reorganization following br 
it is difficult to understand thi 
regenerative specificity, 


ain damage, 
s degree of 
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In contrast to the s 


during regeneration in t 
system, 


pecificity observed 
he mature nervous 
the nervous system of the neonate 
apppears to be capable of considerably plas- 
ticity. The extent of this plasticity is such 
that rather large changes in the functional 
organization of the nervous system as well 
as more concrete anatomical changes, for 
example, the thickness of the cortex, dendri- 
tic branching, and the like, can occur i? 
response to a variety of experimental ma- 
nipulations, This is, of course, just the sort 
of neural responsiveness postulated by the 
concept of functional reorganization to 4€ 


" ang 
count for behavioral followme 
brain damage, to EE 


notion of functio a 
individual passe 


recovery 
However, contrary 
nal reorganization, once 


di 3 nc 

S some critical period iost 
approaches maturity, this plasticity 15 le 
and the neryo tabe 


US System becomes very $ 


Over, the data indicate that this sta 
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experience 


ty 
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H ! a e" yore 
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and procedures specifically 
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Signed to induce neural reorganization and 
restitution. of. function. 

Early research in this area comes irom 
ES "ri ain studies of Wiesel and Hubel 
E pcc om & M iesel, 1970; 
Hubel m iesel, 1965, 1970; Wiesel & 
hy oe 3). The general procedure used 

ss e investigators was to restrict visual 
oe for a short period of time during 
first few months of life by suturmg 
TAN lids of one eye. The effects of 
i Bie deprivation were then assessed 
Sin aan single-unit responses in the 
eloped —— when the animal had de- 
ie is maturity. In the normal cat 
Privato) es cat reared without visual de- 
gie e approximately 80% of the 
gd vor were binocularly responsive. 
sae lk a cat, however (that is, the 
eyes E b a short time with one ot its 
CR e we shut), this binocular respon- 
ess. Wie to approximately 7% or 
nefvoùs bos regard to the stability of the 
Tte ur "uw: it is important to note that 
eso serian ?th to 10th postnatal week, 
ittle modifa are quite permanent and show 
Provided Vire even when the cat 1s 
cular vis a a up to 4 years of normal bino- 
n experience. It would seem, if 
tional -— system were capable ol func- 
normal aT that this amount ot 
siderale pen experience should allow con- 
appear E ecovery. That it does not would 
Mature, ar Fe neural centers, once 
ity is los unetionally stable. This stabil- 
similar mie attested to by the fact that 
even of 1 c deprivations in adulthood, 
able mius 8 duration, are without notice- 
ie eges on cortical electrophysiology if 
Sirti Stag age binocular visual 

Other e during its development. 
similar t en have reported results 

im i those of Hubel and Wiesel, results 
anv oed because they did not 
Vital zh a D depriving the individual of 
1970. Pinon. REA ir & Lope 
& E esie & Spinelli, 1971; Pettigrew 
are one 1973). For example, Blake- 
lino th ooper reared cats m darkness 
E ^ li e early developmental period with 
either : pron of daily 5-hour exposures to 

orizontal or vertical stripes. When 
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the kittens were 5 months old, they were 
allowed normal visual experience and under- 
went both behavioral and electrophysiologi- 
cal testing. After some minimal adaptation 
period, the subjects appeared to be visually 
quite competent with the exception of being 
virtually blind to stimuli oriented 90° to 
their early stimulation. Single-unit record- 
ings from the striate neocortex of these 
animals detected the normal amount of 
binocularly responsive cells, except that units 
responding to lines oriented at right angles 
to the kittens’ early visual experience were 
conspicuously absent. And it is critically 
important to note that this recording was 
accomplished after the subjects had under- 
gone some amount of behavioral testin 
which should have provided time and specific 
stimulation for visual reorganization if it 
were to occur, As Blakemore and Cooper 
(1970) stated, “Evidently the visual ex- 
perience of these animals in early life has 
modified their brains, and there are pro- 
found perceptual consequences" (p. 478). 
To this, the present argument would only 
note that these changes appear permanent. 
Similar results have been obtained when 
the visual experience has been restricted to 
small point sources of light (Pettigrew & 
Freeman, 1973), as well as when different 
types of restrictions have been placed on 
each individual eye (Hirsch & Spinelli, 
1971). Recently, Freeman and Thibos 
(1973) have extended these findings to hu- 
mans by correlating visually evoked re- 
sponses to meridional resolution in astig- 
matic individuals. While the mechanisms 
underwriting this developmental plasticity 
and adult stability are presently somewhat 
speculative and extend from postulations of 
cortical units dropping out due to lack of 
stimulation to the tuning of units to cor- 
respond to the environment in which the 
individual is reared, the fact remains that 
whatever occurs, when it occurs, it is per- 
manent. As implied by all of the previously 
discussed research and specifically tested 
by Dews and Wiesel (1970) and Ganz and 
Haffner (1974). normal visual experience 
does not rectify the conditions established 
during the sensitive developmental period. 
Even suturing the eyelid of the originally 
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undeprived eye, thus compelling the animal 
to use the previously deprived eye, has only 
minimal effects on overt visual behavior 
and is virtually without influence on cortical 
electrophysiology. What one is then forced 
to accept on the basis of these data is that 
subsequent to maturation, the nervous sys- 
tem is functionally stable. Given this sort 
of neural stability, it would seem somewhat 
inappropriate to suggest that behavioral 
recovery following brain damage is the re- 
sult of neural centers changing or reorganiz- 
ing their functional mode of operation. 


Neural Sprouting 


Further evidence that the mature mam- 
malian nervous system is functionally stable, 
but not static, is provided by the research 
concerned with neuromorphological plastic- 
ity, Similarly to the early-experience re- 
search, the data present a picture of the 
nervous system as a biological System cap- 
able of growth and change, but only within 
a relatively restricted frame of reference— 
a frame which, in the mature individual, 
does not encompass functional reorganiza- 
tion, It must be emphasized, however, that 
what is at issue is not whether the peripheral 
nervous tissue may reinnery 
motor mechanisms and thus precipitate be- 
havioral recovery, nor is it denied that re- 
innervating may even occur centrally to 
provide some measure of behavioral re. 
covery. In both of these Cases, the effect 
is not one of reorganization but rather one 
of reinstitution, and this is well within the 
assumption of neural Stability espoused by 
the present report. 

From the functional 
data concerned with 


ate sensory or 


point of view, 


the 
neural 


r Sprouting in- 
dicate, as in the early-experience data, a 
highly plastic nervous system 


during early 
neonatal development but a nervous system 


that rapidly | stabilizes during postnatal 
development, I will first consider the re. 


search concerned with neural Sprouting in 
the mature animal, 


Neural Sprouting: 


The Mature Ner 
System 


vous 
With regard to the mature n 


f I €rvous syg- 
tem, the major question 


1S whether the 
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manipulations that induce regenerative Or 
collateral neural sprouting, as the terms are 
defined by Stenevi, Bjórklund, and Moore 
(1972), will produce generalized or d 
stricted neural sprouting. If the pao xi 
neural sprouting following neurologica 
insult is generalized (i.e., it includes neural 
centers or fiber processes normally not part 
of the system affected by the lesion) then a 
primary condition for functional reorganiza- 
tion is met and the notion may gain con- 
siderable support. Tf, however, the neural 
sprouting is restricted to the areas directly 
related to the neural insult, functional re- 
organization is considerably less likely. In 
this latter case, as previously suggested, the 
effect of the neural sprouting would be better 
characterized as functional reinstitution, not 
functional reorganization, and as such would 
be compatible with the notion of neural 
stability, 

By and large, the data suggest that com- 
pensatory neural sprouting is quite re- 
stricted. The recent research of Good- 
man, Bogdasarian, and Horel (1973) may 
be taken as a specific example. This re- 
search was generated às an extension of 
earlier experiments (Goodman & Horel, 
1966) indicating that compensatory neural 
Sprouting in response to ablation of the 
striate neocortex occurred in only 2 of the 
possible 13 areas suffering deafferentation 
Subsequent to the neocortical insult, At the 
risk of oversimplifying an eloquent con- 
ceptualization, the methodology of Goodman 
et al. (1973) be summarized as one 
that tested for generalized compensatory 
Sprouting by optimizing the conditions neces- 
Sary for its occurrence, This was accom- 
lished by Partially deafferenting the central 
Visual areas of 3-month-old rats by uni- 
lateral enucleation, Some 16 months later, 
the remaining €ye was removed and after 
> days the tissue prepared for analysis 9 
degenerating fiber processes, using the Lax 3 
low modification of the — Nauta-Gyg2^ 
method. The arguments were that v 
early partial deafferentation should produc 


H S Ü 
* situation where competition from er 
other than the Visual system would be mi! 


may 


E va 
mized because of the remaining retinal 
Š PA j 8 

fibers, This Should, in turn, allow & 
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eralized compensatory sprouting in all de- 
afferented sites. The sprouting that was 
observed, however, was only at the sites of 
normal termination of the retinofugal fiber 
Processes remaining after the initial enu- 
cleation, Increased distribution, which would 
seem to be a minimal but critical require- 
ment for functional reorganization, was not 
observed at any of the visual centers, with 
= possible exception of the superior col- 
ais ky if the physiological con- 
EM : s functional reorganization oc- 
me ae appeared quite restricted. To 
1 hat this limited increase in distri- 
ime d be responsible for behavioral 
Bow m the aegis of functional re- 
Eu on is, in most generous terms, 
I g an empirical finding. 
od peni however, it must be noted that 
inii y research concerned with com- 
aena S menn. has indicated some 
mk E. generalized innervation (Me- 
Cajal ans zd Liu, & Liu, 1958; Rámon y 
ea Mer 59; Rose, Malis, Kruger, & 
sand ha ), although more recent re- 
iur is x Suggested that the neural sprout- 
i3 ry restricted (Cunningham, 1972; 
m man et al., 1973; Goodman & Horel, 
md i eios Pcie Parks, & 
1971; iom ; -— prem & Stenevi, 
Ce x m 202; Schneider, 1970; 
} al., 1972; Wall & Egger, 1971). 
Sean: and most important, anomalous 
neural s ns resulting from the compensatory 
in mele irit have never been reported 
LUN an C using mature subjects (Cun- 
Ends: 2; Cunningham & Lund, 1971 ; 
1959. ja & Horel, 1966 ; Liu & Chambers, 
1058 ^; nch et al., 1972; McCouch et al., 
Wall g Rm et al., 1971; Raisman, 1969; 
iiu gger, 1971). These latter results 
‘ind aps worth emphasizing, because it 
ice that anomalous compensatory 
bine. would he very strong neural 
fein evidence for functional reorgan- 
ota i. should it occur in the mature ner- 
p yam. That it does not would seem 
Nee Ge that behavioral recovery must 
a on Processes not involving the vicari- 
a tunctioning of neural centers not norm- 
Y involved in the behavior in question. 
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Neural Sprouting: The Neonate 


In distinction to manipulations performed 
on the mature nervous system, compensatory 
sprouting following neonatal lesions pro- 
duced quite a different pattern of results. 
Here, as with the effects of early experience, 
the picture is one of general neural plasticity. 

For example, the normal rat's visual sys- 
tem is almost totally crossed with only a 
limited number of ipsilateral retinofugal fiber 
processes, and these have a very restricted 
central distribution (Hayhow, Sefton, & 
Webb, 1962). If, however, unilateral enu- 
cleation is performed during the first 10 
days of postnatal life, there is a significant 
increase in the number of unilateral retino- 
fugal projections (Lund, Cunningham, & 
Lund, 1973). Moreover, these ipsilateral 
fiber processes originate from retinal areas 
normally producing only contralateral pro- 
jections and terminate in geniculate and 
collicular areas normally receiving only pro- 
cesses from the contralateral eye. Thus, it 
would appear that, similar to the ear 
perience research, there is considerably 
greater flexibility in the developing nervous 
stem than there is in the mature nervous 
system. 

Perhaps a more dramatic demonstration 
of this sort of developmental plasticity is 
provided by Schneider (Schneider, 1970, 
1973), who was concerned with neonatal 
brain stem lesions. This research not only 
confirmed the existence of anomalous syn- 
aptic connections but also provided convinc- 
ing evidence that these synaptic connections 
were operational. At first glance, the exist- 
ence of operational anomalous synaptic con- 
nections would seem favorable to the concept 
that hehavioral recovery is mediated by func- 
tional reorganization, but, as will be more 
fully discussed below, an analysis of the re- 
sulting overt behavior indicates that just 
the opposite is true. The general experi- 
mental procedure used by Schneider was to 
ablate, at birth, the superficial layers of one 
superior colliculus in a number of golden 
hamsters, The animals were then allowed 
to grow to maturity with normal visual 
stimulation and then were given a series of 


behavioral tests to determine the effects of 
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the early superior colliculus lesions. prm 
ing this, the animals underwent d a 
procedures to determine the vio. o ies 
pensatory neural sprouting, which i 
unilateral enucleation followed, E days ater, 
by histological analysis using various modi- 
fications of the Nauta method to assess 
degenerating fiber processes, u 
This analysis indicated rather Striking 
abnormalities in the distribution of retino- 
fugal fiber processes and terminations, Prin- 
cipally, there was evidence for an abnormally 
high number of fiber terminations in the 
ventral lateral geniculate nucleus, Addi- 
tionally, and somewhat more interestingly, 
there was evidence of fiber termination in 
the posterior lateral nucleus of the thalamus 
as well as the lower, undamaged layers of 
the lesioned superior colliculus, both 
which do not normally receive such fibers. 
Moreover, not only did the retinofugal fiber 
processes reach the lower, undamaged layers 
of the lesioned superior colliculus, but some 
also decussated and invaded the opposite, 
unlesioned superior colliculus, where the 
competed for terminal space with the fiber 
processes normally afferent to that area. 
Thus, the impression tl 
nervous system is highly plastic and matures 
in accord with the available Sensory input 
and, in the present instance, available neural 
mechanisms is again reinforced. 

But this plasticity of the de 
vous system, as in some 
data, falls short of being 
tive, 


of 


hat the developing 


veloping ner- 
of the regeneration 
functionally adap- 
For example, behavioral testing of 
those subjects subsequently shown 
the abnormal decussating fibers to the un- 
lesioned superior colliculus 
strange visual orienting beh 
animals, visual stimuli prese, 
abnormally Projecting to 

superior colliculus often led 
Sponses away from the y 
is, the animal would res 
Opposite eye we 
lation, 


unlesioneq 
to orienting re- 
isual stimulus, That 
pond as though the 
re receiving the visual stimu- 
at the 
anomalous synapti S were Opera. 
tonal, it also, at the same time, clearly sug- 
gests that the anatomical plasticity of the 
retinofugal fibers and the neura] reorgar 
tion observed by ë 


Schneider (1970, 


niza- 


1973) 
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did not yield correlative behavioral ke 
tion. I would suggest that the reason tha 
this behavioral restoration did not occur ie 
because the plasticity of the epi 
pathways was not associated with a eon 
plasticity within the mesencephalic term T 
tion area, Rather, at birth the superior co i 
liculus and perhaps other brainstem e 
centers are probably close to maturity, Sei 
like other areas reaching maturity after 
birth, they are functionally stable. The fact 
that the anomalous projection did not cause 
any sort of functional reorganization of the 
mature surviving superior colliculus is criti- 
cal and clearly indicates a lack of functional 
reorganization, even when the conditions 
seem optimal for its occurrence (i.e., opera- 
tional anomalous input). More simply put, 
the overt behavior patterns exhibited by 
lesioned animals appear to be a function of 
the neural areas spared by the neurological 
insult, and once mature, these neural areas 
do not reorganize in any functional sense 
but simply continue to operate in accord 
with the principles established at maturation, 
In summary, what the research concerned 
with neural sprouting, along with the re- 
generation data and early-experience data, 
Suggests is that the nervous system, like 
other biological Systems, is plastic and malle- 
able during its developmental Stages, How- 
ever, once development has run its course, 
the nervous System, again like other biologi- 
cal Systems, becomes quite Stable, And in 
functiona] terms, this Stability is maintained 
in the face of rather severe physiological 
Manipulations, What the remainder of the 
Present report attempts to show is that this 
degree of stability is not incompatible with 
the observation of behavioral recovery fol- 
owing neurological insult, Tn fact, much of 
the data concerned with such behavioral re- 
covery is not only compatible with the notion 
nters are functionally stable, 


that neural ce 
s è ) 
but provides striking evidence t 


Support. jt. 


rather 


Nee, 1 1 t, 

Accepting, if only for the momen na 
Stable nervous System incapable of functio": 
Teorganiz 


a 1 
ns Mes ; ex plat! 
ation, how is it possible to exp“ 


—_—, 
M. a 


Ay) 
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behavioral recovery following neurological 
insult? The possibility I would like to 
Suggest is based on sparing. To reiterate, 
the thrust of the present paper is that the 
nervous system is not unique among the 
various biological systems supporting life, 
and, like other biological systems, opera- 
tional recovery following damage is depend- 
E. dit ui spin: and what continues to 
1 in its normal manner. From this 
en bonia. behavioral recovery following 
edm A simply the result of the 
beladas s nanisms supporting the recovered 
bite never being destroyed. 
dier roy of this position, one might con- 
iux le f ] the sensorimotor recovery that 
y »served following lesions that sever 
mI neural fibers. Initially, the ability 
last “4 P to move the affected limb is 
the dec Dy common tests and definitions, 
S nbn is, in fact, neurological. However, 
pte aa poup of time, calculated at about 
the hoi ¢ ay : om the site of the lesion to 
sticker Pres the affected area, sensory and 
es seer tions may often recover. Clearly, 
olei ‘ii öne: gb functional recovery in- 
the uo cians But just as clearly, 
me os : eral recovery does not involve any 
eid functional reorganization, where one 
twee a changes its mode of operation 
ea. a to the destruction of some other 
e i. er. T he process simply one of re- 
i 1 and reinnervation, and little else. 
en Re fin a pointed out, con- 
re rm Naps remains to be done to detail 
sten i mechanisms which govern the 
ss PA cay However, the fact nonethe- 
he im that the sensorimotor behavior 
Erud ái ered because, in the real sense, the 
the ne MAS underlying it survived 
t irological insult. . 
valen turns to behavioral recovery in- 
tion E central nervous system, the ques- 
osition ved whether or not a similar prop- 
OF tke a be valid. From the arguments 
ít folo à SO report, the answer 15 yes, and 
Je ee irem a series of studies. suggesting 
nene eem behavior patterns often survive 
» inii Seit even though they may not 
hio: y observed in the subject's overt 
"lor immediately after the brain lesion. 
Nd it is of some importance to note that the 


range of behaviors over which this appears 
to be true is as impressive as the range of 
behaviors which may demonstrate recovery 
subsequent to neurological damage. 

One might consider first the reflexive be- 
havior of visual placing. The usual pro- 
cedure to test visual placing is to hold an 
animal, for example a cat, either above or 
below a table and slightly away from the 
edge. The animal is moved slowly up or 
down toward the edge of the table, with the 
result that when the edge comes into the 
animal's field of vision, the cat promptly and 
accurately places its forepaws on the table. 
Early data from Bard and Brooks (1934) 
indicated that after removal of the animal’s 
visual neocortex, visual placing was lost 
and did not appear to recover even with 
extensive postoperative testing. These re- 
sults were later replicated by Meyer, Horel, 
and Meyer (1963) and were generally at- 
tributed to the animal's inability to detect 
edges (i.e. a dysfunction of pattern vision). 
However, Mever et al. (1963) and more 
recently Meyer, Meyer, and Ritchie (cited in 
Meyer, 1974) noted that if the cat was tested 
under the influence of the drug dl-ampheta- 
mine, visual placing was reinstated and re- 
mained as a viable behavior pattern so long 
as the drug state was maintained. Obviously, 
these data provide rather salient evidence 
that the placing behavior disrupted by visual 
neodecortication was not destroyed, even 
though it was not easily detectable in the 
subject's postoperative behavior. Moreover, 
and more important, it is difficult to attribute 
the return of visual placing under the in- 
fluence of dl-amphetamine to any sort of 
neural reorganization. In the first place, 
behavioral recovery is perfectly correlated 
with the assimilation and systemic action 
of the drug, and in the second place, it is 
difficult to understand how the neural stimu- 
lant would cause a functional reorganization 
that is not permanent. 

Another reflexive behavior disrupted by 
neurological insult but apparently not des- 
troyed, lost, and recovered through the 
functional reorganization of neural centers 
is visual orienting. The phenomenon that 
demonstrates this was described by Sprague 
(1966) and ixvolves a unilateral lesion of 


> 
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the visual neocortex followed sometime 
later by destruction of the contralateral 
superior colliculus. Subsequent to unilateral 
extensive visual neodecortication (occipito- 
temporal cortex from splenial sulcus to 
rhinal fissure, or essentially the posterior 
half of the available cortex), cats are con- 
tralaterally hemianopic with regard to a 
number of visual behaviors, such as blinking 
in response to objects thrust into their field 
of vision, following moving objects, avoid- 
ance of obstacles, visual placing, and depth 
perception. However, the removal of the 
superior colliculus on the opposite side of the 
brain immediately restores visual functions 
to the hemianopic field. Here, as with the 
research of the Meyer group, the recovery 
of the visual behavior is difficult. to align 
with the concept of functional reorganiza- 
tion of neural centers, for similar reasons. 
Paramount among these reasons is the fact 
that the recovered behavior appears im- 
mediately following the subcortical lesion. 
Furthermore, it is difficult to understand 
how a subcortical lesion might spontane- 
ously reorganize neural centers to the extent 
that areas previously unable to support the 
visual behavior are now able to do so. Fol- 
lowing the arguments of the present report, 
a more plausible explanation of these re- 
sults would be that the visual behaviors 
Were recovered because the neural mechan- 
isms that control them survived the initial 
visual lesion, and the Second lesion simply 
enabled their expression, This, as is dis. 
cussed below, is quite similar to the inter- 
pretation offered by Sprague (1966) in his 
analysis of the data. 
Turning to more complex bel; 
side the visual realm, one m 
recovery of feeding behav 


pothalamic lesions, The lateral hypotha- 
lamic syndrome, as it is now called, sum- 
marizes the aphagia and adipsia which typic- 
ally result from bilateral destruction of the 
lateral hypothalamic nuclei, While the 
aphagia and adipsia are quite severe initially 
the lesioned animals may regain nearly 
normal feeding behavior if Provided with 
rather extensive postoperative care, Teitel- 
baum and Epstein (1962) have detailed 
with great precision the progress of this re- 


aviors out- 
ay consider the 
tor following hy- 
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covery, as the animal successfully progresses 
from tube feeding to accepting highly palat- 
able foods to eventually eating dried rat 
pellets and drinking on its own. Considering 
the time course and the required postopera- 
tive therapy, this is apparently a clear eR- 
ample of functional reorganization, where 
neural centers other than the lateral hy- 
pothalamus are operationally reprogramed 
to mediate the feeding behavior. The ob- 
servation that postnatal development of 
neonatal feeding may parallel this recovery 
process (Teitelbaum, Cheng, & Rozin, 1969) 
is further support for this contention. The 
evidence would, in fact, seem compelling, 
except for one peculiar problem reported by 
Powley and Keesey (1970). These re- 
searchers noted that if the rat was starved 
to 8076 of its body weight prior to the 
lateral hypothalamic ablation, there was no 
postoperative period of aphagia and adipsia. 
Moreover, if the animals were starved be- 
yond 80% body weight, the postlesion be- 
havior was one of hyperphagia. While 
Powley and Keesey attribute these results 
to a change in the animal's weight "set 
point," what is important for the position 
of the present paper is that it is difficult to 
align these results with the notions that the 
Tecovery of feeding behavior is a result of 
the functional Teorganization of neural tissue 
—unless one is Willing to suggest that 
starvation can precipitate functional reorgan- 


H H " . = 
ization. Again, behavioral recovery would 
appear to occur simply because the neural 
mechanisms cont 


1 rolling the behavior sur- 
Vived the neural lesion. 


The two-choice brightness discrimination, 
as described py Lashley (1935), may be 
used as the general paradigm to inquire if 
the arguments presented so far may gen- 


eralize to the recovery of learned behaviors. 
The general 


procedure requires than an 
animal, typically a rat, be trained in a two- 
choice discrimination box to avoid and/of 


escape footshock, 

differ in brightnes 
usually black anq 
white. The cues 
reflecting light off 
son-Bry 
lucent 


The discriminative cues 
S. with the darker cue 
the lighter cue usually 
are generated either DY 
painted panels (Thomp- 
an box) or by back lighting trans- 
panels (Yerkes box). Behavior? 
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training consists of a number of daily trials 
in the apparatus where the animal is allowed 
to avoid/escape shock by responding to the 
cue leading to a safe goal area. After the 
subject has mastered the discrimination to 
Some preset criterion, say 9 correct choices 
out of 10 trials, it is subjected to bilateral 
posterior neodecortication to include those 
Pria to the dorsal lateral geniculate 
R s, as described by Lashley (1934) 
— — mapped by Adams and 
Mi > (19065) and Montero (1973). 
order a SURE recovery period, on the 
fione! | oy toa month, the animal is 
loue on d the discrimination box and re- 
ness dis he preoperatively mastered bright- 

?5 discrimination. 
^ gu result of this retesting is that 
eiie p s initially perform at chance and 
trials tc approximately the same number of 
nation vi MA the brightness discrimi- 
the Sel ke THISIEU to master it during 
the eek preoperative training. Thus, 
liist "hes, illustrates that the learned 
dni Ligier discrimination may he 
also that i y striate neodecortication but 
"€ 1 is not beyond the reach. of the 
iem animal. The question of interest 
habit in M the recovery of the brightness 
ization dicates some sort of neural reorgan- 
Teactivatio! alternatively, the sparing and 
P ie rw neural mechanisms that 
tively, ae behavior pattern preopera- 
m us pio ication tha! the neural mech- 
may not * erlying discrimination behavior 
Comes fr e lost following a striate lesion 
hd M. "i p research of Braun, Meyer, 
studied a (1966). These investigators 
Bistamine possibility ‘that the drug am- 
in the ts which facilitates visual placing 
Sie cat, might also facilitate the 
Movin & n ut two-choice discriminatory be- 
gantly sin eye Their procedure was ele- 
ey (19355 e and followed the classic Lash- 
tant ex 3) paradigm, but with the impor- 
Rider that some of the rats were 
athe ee retrained under the influence 
receive ijs Peg: ssp bape other groups 
retraine a ee of saline or were simply 
livre a d. The results. indicated that the 

S amphetamine facilitated the reacquist- 


tion of the brightness discrimination, while 
saline injections had virtually no effect rela- 
tive to the control animals who simply re- 
learned the brightness discrimination. 

By itseli, this result is somewhat unim- 
pressive, since it is possible that the drug 
amphetamine might facilitate acquisition 
independent of any other condition of the 
experiment, However, a subsequent experi- 
ment reported by Braun et al. (1966) sug- 
gested that the facilitative effects of amphet- 
amine were dependent on whether or not the 
lesioned animal had learned the brightness 
discrimination preoperatively. In other 
words, the drug amphetamine was facilitative 
only because there was a previously estab- 
lished behavior pattern to facilitate. Since the 
drug was without effect if the animal had not 
had preoperative training. it would seem ob- 
vious that something of the preoperative 
brightness habit must survive striate neo- 
decortication. And should one argue that 
amphetamine really facilitates functional re- 
organization—not reactivation—there is the 
nagging question of why this facilitation of 
functional reorganization should occur only 
when the subject was preoperatively trained 
on the brightness discrimination. Again, it 
would appear that the postoperative cap- 
abilities of a subject are dependent on what 
is spared by the neural insult and not what 
is reorganized after the neural insult. 

However, as is often noted by the skepti- 
cal, there are two effects of drugs on be- 
havior, "those you are aware of and those 
you are not aware of.” To argue the case 
ior sparing as the process of behavioral 
recovery, it might be more convincing to 
restrict the experimental techniques to those 
involving only behavioral procedures. This 
was accomplished recently by LeVere and 
Morlock (1973). These researchers trained 
rats on a brightness discrimination task and 
subsequent to mastery subjected the animals 
to extensive ablations of the posterior neo- 
cortex. Subsequent to recovery, the lesioned 
animals were divided into groups, which 
were either retrained on the original bright- 
ness discrimination or retrained on the re- 
versal of the original brightness discrimina- 
tion, The argument was that if the neural 
traces of the preoperatively established. be- 
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havior survived the posterior neodecortica- 
tion, then retraining on the reversal of the 
original brightness discrimination should 
prove quite difficult, This is so because the 
spared neural mechanisms would be the 
exact antithesis of what is required post- 
operatively of the reversal group. 

The results indicated that the postopera- 
tive reversal group was indeed significantly 
impaired, and since this impairment could 
not be attributed to either differential bright- 
ness preferences (if such exist) or differ- 
ences in neural damage, LeVere and Mor- 
lock concluded that not only were the pre- 
Operative neural mechanisms spared, but 
they were also apparently influential in the 
postoperative recovery of the visually guided 
behavior, Subsequent research by these 
Same investigators (LeVere & Morlock, 
1974) has indicated that the result is not 
restricted to the simultaneous brightness 
discrimination paradigm but also occurs 
when rats are trained on a successive bright- 
ness discrimination task. And again, the 
data are incompatible with the concept that 
behavioral recovery is mediated by func- 
tional reorganization. The postoperative 
behavior patterns available to the individual 
are simply those behavior patterns that are 
dependent upon neural mechanisms spared 
by the neurological lesion, 

Thus it would appear that the ev 
suggests that behavior: 
neurological insult ma 
to the possibility 
mechanisms wi 


idence 
al recovery following 
y often be attributed 
that the underlying neural 
ere not lost in the first place, 


er which this conclusion ap. 
Pears to hold true would Suggest that it is 


quite general and not restricted to reflexes 
H . H H 4 
basic homeostatic behaviors, or learned pe. 


haviors, To maintain, on the other hand 
that behavioral recovery is the result of some 
unspecified, unique characteristic of neural 


tissue allowing for functional reorganization 
Is not just an unnecessary complication but 
rather, and more Seriously, a denial of the 
experimental data, Put another way, it is 
difficult to appreciate the need for the func- 
tional reorganization of neural Centers to 
enable the vicarious duplication. of neural 


mechanisms apparently not eliminated by 
neurological destruction, 


lost, 
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A Word on Learning 


It must be pointed out that the presan 
arguments do not deny or disallow functiona 
recovery through a process of learning. In 
fact, the possibility that learning may pre- 
cipitate functional recovery following neuro- 
logical insult is directly compatible with the 
notions of functional stability and sparing, 
as these concepts are developed in the pres- 
ent paper. In this instance, what is spared 
is not a specific neural mechanism and its 
associated behavior pattern but rather a 
process or capacity. The only rub is that it 
is difficult, in some definitional sense, to 
consider learning or relearning as a mecha- 
nism for the recovery of specific behavior 
patterns. Learning from the point of view 
of the individual is simply learning, whether 
it occurs prior to or subsequent to brain 
damage. Moreover, there is the distinct 


possibility that the behavior 


Ee ” 
recovered 
postoperatively 


might be quite different in 
detail from the preoperative behavior, be- 


cause, for example, the individual may be 
relearning on the basis of different environ- 
mental cues (Bauer & Cooper, 1964; Bland 
& Cooper, 1970; Cooper, Blochert, Gil. 
lespie, & Miller, 1972: Goodale & Cooper, 
1965). In this Case, the postoperative be- 
havior should surely not be considered a 
recovered behavior, since it did not exist 
preoperatively and thus, in point of fact, 
could not be recovered. To say that learn- 
mg subsequent to brain damage is behavioral 


recovery is to complicate the issue by em- 
phasizing the Outcome of a process, the at- 
tainment of some 


| behavior, rather than the 

Process itself, learning, 

Some Possrgrg MECHAN 
Recovery or 


If the above analysis has any merit, and 
it is granted th 


1 at the nervous system is 
Pest considered Within the conceptual frame- 
work of other biological systems, how is it 
paesible to explain recovery of function fol- 
owing i i i 

M brain damage? Obviously, it falls 
LUNG Short to simply say that if a 
Speci ehavi i d ic i 
pus - . ,I$ recovered following 

al ins chavior was never really 
lore concrete is needed, 


ISMS GOVERNING 
Function 


if 


1 
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but presently the data base does not allow 
any firm theoretical conclusions. However, 
there are three models that appear to have 
considerable merit and that are compatible 
with the notions of neural stability and 
Sparing. One of these models is concerned 
with the reaccessing of memory engrams and 
been recently formulated by Meyer 
oe a segond involves the inhibition 
e aet itation of surviving neural cen- 
rs by other spared neural centers 
Ine, 1966). The third is a reactiva- 
ley (1932 ye originally proposed by Lash- 
Loir es i 935) and recently revived by 
Sèrhans ane. Morlock: (1973, 1974). It is 
disc ps worthwhile to conclude the present 
cussion with a brief encapsulation of these 
tim Shute they subsume much of the 
lie y; y repe data and will most prob- 
we M Pi E n future inquiry into 
Bi lim ot function. For a full appreciation 
"s s propositions, the reader is urged 

S een the original sources. 
IE here notion. (Meyer, 1972) 
SHE ori e the fact that behavior depends 
individee kr engrams but also the 
&ngfam ia p jility to access these memory 
disraptiy s neatly argued by Meyer, the 
Biecipitat dem of neural damage could be 
tat ed either by the destruction of the 
Mass te itself or, alternatively, by 
dens. ns the ability of the individual to 
Nance e E Since, as pre- 
fbi Mer: ined, it appears that specific 
alten bwin se (memory engrams) are 
niii ie following neural lesions, the 
volved s mechanism or mechanisms 1n- 
n wo be pivotal in the recovery pro- 
Nei oreover, if it is accepted that the 
thie Ee is functionally stable, then 
pon ainda of the memory engrams 
result in p particular behavior would surely 
hope i a loss of the behavior, with little 
Sad d b is excluded, for func- 
the REAN ue However, to strictly align 
w yen concept with the current 
Sumed d unctional stability, it must be as- 
fhgram ne the accessing routes to memory 
ret are more diffuse than the memory 
i i themselves, W hile Meyer (1972) 
Pia ot make this specific assumption, it is 
a terribly unlikely assumption if one 


considers the spread of evoked electrical 
activity following localized brain stimulation. 

Another model compatible with the no- 
tions of stability and sparing has been pre- 
sented by Sprague (1966). This approach 
is somewhat more of a systems notion than 
that suggested by Meyer (1972) and sug- 
gests that various behavioral functions are 
mediated at cortical and subcortical levels 
by neural centers organized into systems. 
Neurological insult causes dysfunction be- 
cause the system is thrown out of balance by 
changing inhibitory/facilitory relations be- 
tween the various components of the dam- 
aged system. With regard to the visual 
system, perhaps Sprague's (1966) com- 
ments are most succinct : 


Visual attention. and perception are mediated at 
both forebrain and midbrain levels which interact 
in their control of visually guided behavior. Hemi- 
anopia caused by cortical lesion is due to an im- 
balance of these neural centers that subserve 
vision, resulting in an alternation of function at 
the midbrain level. (p. 1547) 


The final model, although obviously not 
the final word, is that originally suggested 
by Lashley (1932, 1935). Descriptively, 
the position may be summarized as a re- 
activation conceptualization where, similarly 
to Sprague, the mammalian nervous system 
is considered to be organized into a number 
of separate subsystems. Neural lesions 
cause behavioral degradation, because the 
individual tends to maximize its interaction 
with the environment and to utilize only 
those neurological systems functioning most 
appropriately. This then shifts the indi- 
vidual's behaviors away from those depend- 
ent on the damaged system and toward 
those dependent on the undamaged systems. 
For example, behavioral control may shift 
from visual cues to olfactory cues following 
striate neodecortication, with the result that 
visually guided behaviors would suffer. Be- 
havioral recovery in this situation. involves 
reactivating the damaged neural system by 
forcing its utilization. to guide behavior. 
When this is accomplished, those behaviors 
that can be supported by the spared portions 
of the damaged system should return, and 
those behaviors that cannot be supported 
should not return. A specific example of 
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the operation of this sort of mechanism is 
provided by the “cross-suture technique, 
which may demonstrate certain residual vis- 
ual functions following early monocular 
deprivation in kittens (Ganz & Haffner, 
1974). 


SUMMARY 


Whatever model of behavioral recovery is 
ultimately developed, it would seem clear 
that it must remain within a certain opera- 
tional framework, The postulate of the pres- 
ent discussion is that this operational frame- 
work is that which is characteristic of 
biological systems in general and encom- 
passes functional stability and postoperative 
behavior dependent on the continued opera- 
tion of what is spared. Conceptualizations 
of behavioral recovery suggesting other 
vaguely specified characteristics, such as 
functional reorganization, should, at least 
for the present, be avoided. The reason for 
doing so is not simply a concession to par- 


simony but rather a respect for empirical 
findings. 
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motivated behaviors, 
stressors, is evaluated. 
associative processes, 


]t is suggested 
nonassociative factors, 
neurochemical changes, determine avoidance response rate. 


The role of acetylcholine and central catecholamines in modulating aversively 
and behavior following exposure to uncontrollable 


] that in the presence of adequate 
mediated by stress-induced 
Moreover, in 


addition to the existence of a balanced state between excitatory catecholamine 


and inhibitory cholinergic systems, 


systems may be mutually regulatory. 


it is posited that these neurochemical 
Excessive stimulation of one system 


may induce a compensatory antagonistic rebound in the complementary sys- 


tem, thereby maintaining 


neurochemical 


homeostasis. Owing to time- 


dependent variations in neurotransmitter activity, temporal variations in per- 


formance may occur following 


jnitial exposure to aversive stimulation. 
Alterations in neurochemical activity that affect nonassociative processes have 
predictable effects on time-dependent v 


ariations in avoidance performance. 


The model is extended to deal with other stress related phenomena, for 
example, helplessness, depression, and ulceration. 


1 e has become increasingly apparent that 
ther ake motivated behaviors, as well as 
(s Pide: involving response inhibition 
Asa nabituation, responsivity to nonrein- 
win stimuli, shock-induced response sup- 
ome are modulated largely by the ac- 
ae a of acetylcholine, catecholamines, and 
a ok. reviews by Bignami, Rosic, 
1963 le cone & Gatti, 1975; Carlton, 
Becks & rd Carlton & Markiewicz, 1971; 
oes tolk, 1970; Stein & Wise, 1974). 

ato ver, although modification of levels of 
found ot these transmitters may have pro- 
den effects on behavior quite indepen- 
Eaa of other transmitter systems, ample 
rn are available indicating that these sys- 
(C act in a reciprocally coordinated fashion 
N arlton, 1963; Chase, 1974; Costall & 
Naylor, 1974; Gellhorn, 1957). Essentially, 
ne by Grant A9845 from the National 

tended is pum ot Canada. Ae be ex- 
Wahlsten ium doe ia ped bag io E: AS 
emington ie K kin is I i Hoff aad 

h Ster Pied ay Kol inidis, Rol ert o man = 
1 draft of this or critical comments on an ear 1er 

s paper. 

aS ae reprints should be sent to Hymie 
iesi epartment of. Psychology, Carleton 
y, Ottawa, Ontario, K1S SB6, Canada. 


the suggestion has been that the catechol- 
amines, whose behavioral concomitants in- 
volve excitation, act antagonistically toward 
the cholinergic and serotonergic systems that 
exert an inhibitory influence upon ongoing 
behaviors. 

To date, the notion of reciprocally antago- 
nistic systems has faired well in accounting 
for phenomena such as drug interactions 
(eg., Carlton, 1963; McGeer, Grewaal, & 
McGeer, 1974; Swonger & Rech, 1972), 
ontogenic variations in inhibitory processes 
( Campbell, Lytle, & Fibiger, 1969; Camp- 
bell & Mabry, 1973), parkinsonism (Barth- 
olini, Stadler, & Lloyd, 1973), stereo- 
typic behaviors (Arnfred & Randrup, 1968; 
Janowsky, El-Yousef, Davis, & Sekerke, 
1972), and factors involved in habituation 
(Carlton, 1968, 1969). Surprisingly though, 
a paucity of data exists regarding the role 
one neurotransmitter system may have in 
modulating the effects, and indeed in alter- 
ing the actual levels, of a second system and 
how such interactions might ultimately af- 
fect performance levels. There is reason to 
believe that when a particular transmitter 
system is excited excessively, an antagonistic 
rebound may be induced in a second system, 
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thereby maintaining balance between ex- 
citatory and inhibitory processes (Agid, 
Guyunet, Javoy, Beaujouan, & Glowinski, 
1974; Bartholini et al, 1973; Gellhorn, 
1957; Manto, 1967; McGeer et al., 1974; 
Richardson, 1974; Sethy & Van Woert, 
1974). 

In the present paper the balance between 
transmitter systems and the possibility of 
their reciprocal innervation are evaluated 
with respect to effects on aversively moti- 
vated behaviors in general and to the effects 
of prior aversive stimulation on subsequent 
behaviors in particular. In the sections that 
follow, it is shown that performance in aver- 
sively motivated tasks, although invariably 
dependent on associative processes, are 
modified by nonassociative events. Because 

_the temporal effects of stress are not neces- 

sarily monotonic, an extended discussion of 
time-dependent variations in performance is 
presented. Finally, neurochemical concomi- 
tants of performance following the inception 
of a stressor are discussed. These neuro- 
chemical events are proposed to mediate 


nonassociative events that affect avoidance 
performance, 


Time-Dependent Changes in Aversively 
Motivated Behaviors 


It has been demonstrated repeatedly that 
following exposure to signaled inescapable 
shock or only a few escape/avoidance trials, 
subsequent active avoidance performance 
progresses more slowly after an intermediate 
retention interval (1-6 hours) than after a 
short (less than 10 minutes) or long (24 
hours) interval (Brush, 1971; Kamin, 
1963). The U-shaped performance func- 
tion is of some interest, because retention 
functions for most associative learning situ- 
ations are monotonic and negatively deceler- 
ating, thus making the avoidance retention 
function somewhat paradoxical in nature, 

While several hypotheses have been for- 
mulated to elucidate the mechanisms sub- 
serving the time-dependent changes in be- 
havior, none of these positions adequately 
account for the available data (cf. Anisman 
1973a; Barrett, Leith, & Ray, 1973: for c 
review of most of the current hypotheses 
see Brush, 1971). Though critiques of some 
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of these hypotheses are advanced later in 
this paper, it is important to note that the 
alternative. positions. expounded by Brush 
(1971), involving availability of corticoster- 
one and coping abilities, and by Klein and 
Spear (1970a, 1970b), dealing with memory 
retrieval, are not necessarily contrary to the 
position advanced in the present paper. 
Essentially, the thesis of the current pro- 
posal is that initial exposure to stressful 
stimulation has both associative and non- 
associative consequences, which interact to 
determine the rate at which a subsequent 
avoidance response will be acquired. 
Whereas changes in associative processe: 
are thought to be minimal over the initial 
24-hour period, nonassociative changes elici- 
ted by stress are not monotonic. Ultimately, 
in the presence of adequate associative abili- 
ties, nonassociative changes may determine 
the form of the retention function. 
'To be more explicit, in order for 
avoidance response to be 
components are necessary, 
must be established, 
Pavlovian processes 
mon, 1967) or expect 


an active 
acquired, several 
A drive state 
defined in terms of 
(cf. Rescorla & Solo- 
ancy of shock (Bolles, 
1970, 1971) ; the animal must learn cen 
to initiate an active Tesponse, appropriately 
direct this response (Anisman & Wahlsten, 
1974), and be able to initiate the active 
Ocomotor response (cf, Anisman & Waller, 
1973a). While it is clear that associative 
factors ultimately set rate limitations on 
avoidance acquisition, the role of nonassoci- 
ative factors in determining response rate 
only recently has become a major source of 
debate with respect to aversively motivated 
behaviors (cf. Bolles, 1970, 1971 ; Seligman, 
1970). In the typical active avoidance situ- 
ation, it has been observed that the rate of 
acquiring the avoidance response was in- 
versely related to the Strength of an incom- 
patible freezing Tesponse produced by shock 
and directly related to the rate at which this 
freezing response could be suppressed ( Anis- 
€ Waller, 1972, 1973a; Bolles, 1970). 
psn of es iet dat in he 
mafibalae late associative abilities. 
es mw that increase freezing behavior 
id ri S pe disrupt pertormanes, 
ose that decrease inhibitory ten- 


— 


— Sicil. 


Nox 


dencies wi 
ative abilities E mne of adequate associ- 
in csse it is unlikely that reduction 
avoidance bel we ENTERS 191 shock will alter 
1975. in ee substantially ( Anisman, 
Kokkinidis, s igen Wahlsten, & 
time-dependei rie With respect to the 
suggestion a changes in avoidance, the 
interval, end been that at an intermediate 
previously —— response contingencies 
ever, animals ablished are present. How- 
quately the pese less able to initiate ade- 
Successful avoi afin response necessary for 
ormance at in ge and. as a result per- 
Seen at brief Ms interval is inferior to that 
locomotor or long intervals, when the 
Somewhat mens can be initiated with 
eman 3 * difficulty ( Anisman. 1973a; 
Waller 1971 mokinnitis, 1974; Anisman & 
1972: Pinel i eae Leith, & Ray. 19712. 
Consider be Mucha, 1973a, 1973b). 
latativern Toon support has accrued for a 
Sequences bes m the nonassociative con- 
derived tes shock. Phis support has been 
response-re ma studies aimed at examining 
posure to Lin changes following €x- 
Tom studi PEMSETIME shock, as well as 
tingencies eS) involving response-shock con- 
ch owe gc and passive avoidance). 
iige air , P exposure to noncon- 
levels. ar odd behavior is characterized by 
mar, 1970: p that exceed baseline (Ku- 
1971: Pi : aeos Corcoran, & Malsbury, 
period v & Mischa. 1973. 1973b). The 
within 10 excitation is transient, subsiding 
ine SUI iminutes and falling below base- 
acterized t geil during the phase char- 
Misses ride response suppression is in- 
hyperacti a to shock intensity, the initial 
itens mig directly related to shock 
Within - nisman & Waller, 1973b). 
tivity do Es after shock exposure ac- 
The Esai: 4 a and approach baseline. 
is een: rate at which activity increases 
aede dem on several factors. such as 
aud the nsity ( Anisman & Waller. 197 3b) 
i585 cb of animal employed ( Anis- 
, 1973a). 
i In esperiments involving : 
ponse-shock contingencies, such as condi- 
tioned emotional response (CER) and pas- 
sive avoidance tasks, it was observed that at 


specific re- 
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short periods after initial shock there is a 
phase of hypermotility. which results in à 
weak CER and poor avoidance performance. 
Zetween the brief and intermediate interval 
there is an increased CER and improved 
passive avoidance performance (Anisman & 
b rem i Geller, Jaryik, & Robustelli, 
; Irwin, Banuazizi, Kalsner, & Curtis, 
1968; Kumar, 1970; McMichael, 1966; 
Robustelli, Geller, & Jarvik, 1970; Tarpy, 
1966; Zammit-Montebello, Black, Marquis, 
& Suboski, 1969). Evidently, although the 
pairing of a stressor with an explicit cue has 
profound associative consequences, the initial 
shock-induced excitation and subsequent re- 
suppression may modify perform- 
That is, during the brief interval 
the response bias created is in- 
th passive avoidance, whereas 
ediate interval, the shock-in- 
ssive avoid- 


sponse 
ance. 
after shock, 
compatible wi 
at the interm 
duced response bias favors pa 
ance and response suppression. 
Owing to the nature of the passive avoid- 
ance task, it is difficult to divorce associative 
ociative factors that contribute to 
the ultimate levels of performance. However. 
when considered together with active avoid- 
ce, the influence of nonasso- 
becomes apparent. For ex- 
ample. in à series of experiments conducted 
by Barrett and his associates (Barrett, 
Leith, & Ray. 197 1b, 1972; Steranka & Bar- 
rett, 1973, 1974). it was observed that fol- 
lowing initial stress both activity levels and 
Y-maze avoidance performance resembled 
U-shaped functions and that activity levels 
correlated directly with avoidance response 
In contrast, discrimination perform- 
the Y-maze avoidance/escape task 
did not vary over the various retention in- 
tervals. Evidently, animals “know where to 
run” (i.e., the memory of the safe and dan- 


gerous locales is intact), but at the inter- 
mediate retention interval animals suffer 
from an inability to initiate the active loco- 
response necessary for successíu 


and nonass 


ance performan 
ciative factors 


rate. 
ance in 


motor 
avoidance. 

In support of such an 
Anisman (19732) observed that among 
Holtzman rats 3 U-shaped performance 
function was observed in à shuttle task fol- 
lowing initial exposure to signaled inescap- 


interpretation, 
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able shock. However, if animals were tested 
in a one-way avoidance task the U-shaped 
performance function was not observed. dt 
seems that under those conditions in which 
freezing cannot be Suppressed readily, as in 
the shuttle task, differences in freezing be- 
havior effectively modulate avoidance re- 
Sponse rate. In contrast, when freezing be- 
havior can be suppressed in relatively few 
trials, as in the case of the one-way avoid- 
ance task, the nonassociative effects elicited 
by initial shock exposure have only very 
minor effects on avoidance performance. 
Together with the data i 


of response Suppres- 
nd may inter- 
an active avoidance task. In 
contrast, at the brief interva], the presence 
of response excitation fay 


Ty assumptions of a 
nonassociative explanation js that the U- 


shaped function is a response to shock per 
ck pairings. Thus at 
iate intervals the ex- 
of activity, respec- 
to modify behavior 

With the attenu- 


and intermed 
citation and inhibition 
tively, would be expected 
in a predictable manner, 
ation of response suppression at the long 
interval, response rate would be primarily 
an indicant of associative processes and 
species-typical defensive behaviors, As in- 
dicated earlier, whether variations i 

i ill be reflected in re 
change (eg. improved active &voidance) 
depends on a variety of associative factors, 
Brush (1966) observed that unlike signaled 
shock, initial expost d shock 
does not promote a U-shaped function, 
Presumably, if the stimulus~shock contin- 
gencies are not well established during initial 
training, then potential differences in per- 
formance due to nonassociatiye factors may 
not be manifested, F, under 
conditions in which animals 


ire to unsignale 
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duced when animals subsequently are re- 
quired to avoid shock by running from zi 
illuminated (i.e, by the conditioned i enl 
lus) to a nonilluminated chamber (Anis 
man, 1973b), Accordingly, subsequent per 
formance at all intervals initially will be Ter 
tarded, and differences in freezing behavior 
at the various intervals will not be detect- 
able. 

It should be considered further that when 
the contingencies of the avoidance task are 
ambiguous even under the best of conditions 
(eg., as in shuttle avoidance), the low ien 
of performance typically observed may adq 
to the obfuscation of the effects of the non- 
associative factors. Moreover, since tasks 
Such as shuttle avoid 


ticularly dependent (cf. 


Anisman & Waller, 1973a; Bolles, 1971), 


the initial establishment of the stimulus- 


n of the U-shaped 
avoidance testing 
mna one-way 
typically is not plagued by 
formance. i ^ 

shuttle task, then 
is observed 
ed stimulus 
ng initial train- 


even when 
is not paired with shock duri 
ing (Anisman & Waller, 1 1; Bauer, 1973; 
de Toledo & Black, 1970). More recently, 
'ghes (1975) also re- 
aped performance func- 
t observed in a one-way task follow- 
Mg exposure to inescapable unsignaled 


Shock in a compartment radically different 
from that used during avoidance training. 


€ finding that the U-shaped 


after 
the Nonassociatiy 
tenable. 

The intera, 
Shock pairin 
function has 


ction between shock and cue- 
gs in determining the U-shaped 
been elegantly shown in experi- 
Ments carried by Pinel and Mucha 
(1973a, 1973p and by Barrett et al. 
Pinel and Mind 
€ administration o 
the period when s 
resulted in an initial excita 


J. 
( 19734) Observed that th 
a Second Shock during 
tivity was low 
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tion and subsequent suppression of loco- 
motor activity comparable to that seen after 
the initial shock session. In effect, the sug- 
gestion made by these investigators was that 
time since the last shock and not time since 
initial training determined nonassociative 
processes. Similarly, Barrett et al. (1971b) 
reported that if animals initially received 
avoidance training involving clearly dis- 
criminable cues, thus establishing the con- 
i is of the avoidance task, and 24 
ecd ae subjected to unsignaled in- 
Eus e Shed subsequent avoidance per- 
ance at various intervals following m- 

T RAUM shock resembled the typical U- 
EROR This particular experi- 
ree! ustrates again that the U-shaped 
unction is dependent on time since the most 
mune stress and not on time since original 
eg Moreover, if the task was clearly 
discriminable during original training, then 
"um a; function was observed following 

signaled exposure to shock. 

Wood age taken together, seem to 
Mie ge the simple presence of non- 
ex I changes following shock will not 
A the occurrence of a U-shaped func- 
tion. The rate-limiting step must be one of 
adequate associative properties being pres- 
i SMOD as the interaction. between 
kae d eae factors and associative proc- 
dive e ermines performance, and nonasso- 
ct aT time dependent, this 
inem d constitutes an ideal situation for 
medi g the tole of biochemical events that 
situan e nonassociative processes from those 
A ons in which biochemical changes oc- 
r and response contingencies are varied. 


Ti — s KF : 
: Cp ees ale Variations in Newrochem- 
al Activity and Avoidance 


ane initial exposure to shock has 
Pi sq d consequences that influence 
tin ersten behavior. What remains to be 
depend sich is what factors mediate the time- 
P we ariations of these nonassociative 

. At an intuitive level it would ap- 
pear most parsimonious to assume that 
reezing behavior, and its underlying mecha- 
nism, should be maximal soon after stress 
and should dissipate thereafter. However, 
if such a model is accepted, a second factor 


must be invoked to account for the absence 
of freezing behavior and the high level of 
performance observed at the brief interval. 
One possibility is that shock elicits transient 
biochemical changes that retard or antago- 
nize the course of response inhibition that 
might otherwise occur. In effect, it is as- 
sumed that mechanisms that produce re- 
sponse suppression are at least partially 
operative, but the stimulation of a second 
system masks these suppressive effects. At 
this point, further biochemical mechanisms 
can be introduced. These are assumed to 
be more sluggish in nature and may be a 
consequence of the initial biochemical 
change. The maximal effects of these 
changes are thought to occur at an inter- 
mediate interval, and these tend to exacer- 
bate response inhibition. Thus, the U- 
shaped function could well be due to inter- 
action between the initial biochemically in- 
duced behavioral excitation and the some- 
what postponed behavioral inhibition elicited 
by further chemical alteration. At relatively 
long intervals following the initial stress, 
chemical levels should approach a base state, 
and consequently, performance at this in- 
terval would more closely approximate the 
uncontaminated associative effects of the 
initial training. 

Recently, the nonassociative effects of 
shock have been attributed to variations of 
several neurotransmitters ( Anisman, 1973a; 
Anisman & Kokkinidis, 1974; Barrett, Leith, 
& Ray, 1971b; Manto, 1967; Thompson & 
Neilson, 1972; Brett, Note 1). Together, 
these studies have yielded an integrated 
model involving the interaction and mutual 
innervation between several neurotransmitter 
systems. Specifically, as depicted in Figure 
l, exposure to shock is thought to elicit a 
transient cathecholamine (dopamine and 
norepinephrine) action, followed by a cho- 
linergic rebound together with subsequent 
decreased catecholamine activity (cf. Anis- 
man, 1973a; Anisman & Kokkinidis, 1974; 
Barrett et al., 1971b: Manto. 1967 ; Thomp- 
son & Neilson, 1972; Brett. Note 1). The 
initial catecholamine action is thought to re- 
sult in behavioral excitation (Costa & 
Groppetti, 1970; Fuxe & Ungerstedt, 1970) 
and consequently proficient avoidance be- 
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INHIBITORY CHOLINERGIC 
REBOUND 


EXCITATORY 
CATECHOL AMINE. 
ACTION 


RESPONSE STRENGTH 


RESULTANT AVOIDANCE 


BEHAVIOR 


INHIBITORY 
CATECHOL AMINE 
DEPLETION 


b 


INTERMEDIATE 


TIME AFTER 


LONG 
INITIAL STRESS 


cholinergic 


idance curve. Note that immediately 


S, catecholamine activity shows a transient 


increase and then falls below baseline, 


Cholinergic action increases steadily, 


reaching maximum at the intermediate interval, 


havior (Breese, Cooper, 
Rech & Stolk, 1970). The catecholamine 
action in turn results in a compensatory 
cholinergic rebound, which produces þe- 
havioral inhibition (cf. Carlton, 1969) and 
poor avoidance performance (Anisman, 
1973a; Anisman & Kokkinidis, 1974; Big- 
nami & Gatti, 1966). Moreover, following 
the initial catecholamine excitation, cate- 
cholamine activity declines, thus €xacerbat- 
ing the inhibitory effects of the cholinergic 
reaction. As seen in Figure 1, unlike the 
transient excitatory catecholamine action, the 
cholinergic reaction is more sluggish and 
longer lasting. During those Periods when 
the balance between the two systems favors 
cholinergic action, response initiation js in- 
hibited and avoidance progresses slowly 
whereas the converse i 
when the balance rests in favor of catechol 
mine activity. 

The data in support of such 
derived from 


& Smith, 1973 ; 


a model are 


several sources, In the sec- 


tions that follow pertinent dat 


indicating that (a) balance 
Innervation between 


a are presented 
and reciprocal 

Neurotransmitter Sys- 
tems may occur in both the central and 


Peripheral nervous system, (b) exposure to 
a stressor has Profound effe 
(c) 


ity of severa] 
Pharmacological manipulations have predict- 


he effect roidance in general and on 
the U-shaped function jn particular, More- 
Armacologica] effects are due 


ciative as opposed 


j ; 
Balance and Reciprocal Innervation of 
N urotransinitters 


The notion of reciprocally activated Sys 
tems is no i in the literature. Por 
example, į documentation of 
parasympathetic activity, 
indicated that not only 
PNISÜ between the adrenergic 

Systems, but excitation of 


NoxassociaTIVE [NFLUENC 


one system may result in the activation of 
the complementary system. Specifically, 
sympathetic stimulation may lead to a de- 
eed pingi as well as an increased 
dut o i parasympathetic _ stimulation. 
i eis bos parasyinpedhene discharge 
"ber mate with further parasympathetic 
npa hetia & be suppressed by concurrent 
reportel da stimulation. Gellhorn further 
a T are of the hypothala- 
sión E e icit sympathetic signs: however, 
daton pesca of the hypothalamic stim- 
obseva. be cpi responsivity was 
aineal m arly, if epinephrine was ad- 
terior By dd to; stimulation of the pos- 
rdan x: amus, sympathetic responses 
ition a enored by the hypothalamic stim- 
SÉ: Lis eliminated. On the basis of 
RAK RY servations, it was suggested that the 
Ls UAM m reaction is directly related 
ede (intensity, frequency, or 
Brie ae o Nu preceding sympathetic 
i ds inca he function of the reciprocity 
liim. th ain neurochemical homeostasis 
konanie VE amagoninhe portions of the 
uidens. sy stem, Moreover, the rebound 
the E 1 is mediated by baroreceptors of 
tiim i sinus and aortic arch, since tran- 
Resi wisi edo connecting the barorecep- 
nk = me medulla prevents the occur- 
E rebona effects. While such a 
mimm sm pe possibly account for sym- 
Wülikely Fe parasympathetic balance, it is 
Fie me is same mechanism accounts 
peine. y motivated behaviors, since 
lap a y acting agents typically have not 
ngen obser ved to produce large changes in 
avoidance behavior. | : 

Neto has been observed that sym- 
colier peter may result in a central 
een an rebound similar to that stipu- 
sibel Beuker (1957) regarding para- 
lalce E epic and in addition, the 
tion seo E from the central innerva- 
Silvie by ulate behaviors such as general 
sally Fae av oidance behavior. Specifi- 
affen fou (1967) observed that 1 hour 
rod a ction of epinephrine there is a de- 
UNE general activity. The decline in 
activity was reversed with scopolamine but 
nab by the peripheral analogue methylscopol- 
amine, suggesting that peripheral adrenergic 
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stimulation may result in an inhibitory cen- 
tral cholinergic rebound or that parasym- 
pathetic responsivity is mediated by central 
mechanisms. Remington and Anisman 
(1974) also have observed that the disrup- 
tive effects of epinephrine on avoidance be- 
havior could be reversed by scopolamine. 
Inasmuch as phentolamine, an alpha recep- 
tor antagonist of the adrenergic system, at- 
tenuated the epinephrine-induced decrement 
in activity, Manto (1968) suggested that 
stimulation of alpha receptors, as well as 
beta receptors as suggested by Gellhorn, 
were necessary for the rebound effect to 
occur. This particular notion was supported 
further by the finding that epinephrine, 
which excites both alpha and beta receptors, 
was more effective in the induction of a 
cholinergic rebound than norepinephrine, 
which excites alpha receptors. 

In a pseudoconditioning barrier-crossing 
task, Izquierdo (1974) observed that per- 
formance was enhanced by amphetamine. 
Adrenergic blocking drugs, administered in 
doses that had no effects on their own, suc- 
cessfully antagonized the amphetamine ef- 
fects. Conversely, the peripheral anticho- 
linergic, methylatropine, although producing 
negligible changes in behavior, potentiated 
the effects of amphetamine. Accordingly, it 
was suggested that although amphetamine 
exerts an excitatory effect on behavior, it 
also induces a parasympathetic rebound that 
detracts from the stimulant properties of 
amphetamine. Thus, with cholinergic block- 
ade the inhibitory rebound is dampened and 
behavioral excitation is augmented. Paren- 
thetically, the position. that amphetamine 
exerts both excitatory adrenergic and in- 
hibitory cholinergic effects may be useful in 
accounting for the synergistic action of am- 
shetamine and anticholinergics (eg. Carl- 
ton, 1961a, 1961b, 1963: Carlton & Didamo, 
1961). That is. scopolamine or atropine in 
doses that do not produce behavioral dis- 
inhibition may be sufficient. to antagonize 
the inhibition produced by amphetamine, 
thereby eliciting increased activity. 

Hitherto unpublished data collected in our 
laboratory (Anisman & Kokkinidis, Note 2) 
have yielded results similar to those reported 
by Izquierdo (1974). Specifically, although 
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large doses of d-amphetamine (5 mg/kg) 
produce locomotor excitation in mice, lower 
doses (.5 or 1.0 mg/kg) induce behavioral 
suppression. The peripherally acting agent, 
p-hydroxyamphetamine, administered in 
equimolar doses did not affect activity, thus 
suggesting that both the excitatory and in- 
hibitory effects of d-amphetamine were 
mediated at a central level, If animals were 
pretreated with scopolamine, the depressed 
behavior produced by d-amphetamine was 
not observed, whereas the peripheral choli- 
nergic agent, methylscopolamine, did not 
affect behavior, Thus it seems that the in- 
hibitory effects of d-amphetamine are medi- 
ated by central activity and possibly involve 
cholinergic action, Tt is likely that in 
addition to resulting in increased release and 
J Synthesis of catecholamines, a cholinergic re- 
bound producing behavioral Suppression was 
also elicited (e.g., Mandell & Knapp, 1971). 
With high doses of d-amphetamine the bal- 
ance between catecholamine and acetylcho- 
line activity rests in favor of the catechola- 
mine system, thus producing behavioral 
excitation. With low doses of d-ampheta- 
mine, the balance rests in favor of central 
cholinergic activity and consequently is 
manifested by behavioral inhibition, 

A similar model may possibly hold for 
some of the paradoxical behavioral effects 
elicited by L-dopa. Specifically, it is well 
known that L-dopa in high doses, when ad- 

„ministered with an extracerebral decar- 
boxylase inhibitor, results in increased levels 
of activity. Presumably, this is a result of 
the increased level of dopamine ( Carlsson, 
1970; Fuxe & Ungerstedt, 1970). Para. 
doxically, however, low doses of L-dopa re- 
sult in the retardation of activity (Butcher & 
Engel, 1969), While Peripheral factors may 
contribute to the decline in activity, it is 
clear that central mechanisms are also in- 
volved (Stromberg, 1970). It js likely that 
increases in dopamine result in a cholinergic 
rebound (eg., Sethy & Van Woert, 1973). 

With high doses of L-dopa the balance he- 

tween the two Systems rests on the dopa- 
minergic side and results 

behavior. With low doses the | 

in favor of cholinergic actie 
creased activity is observed, 


valance rests 
m, and thus de- 
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In the investigation of yet another be 
havioral phenomenon, namely that of ster ie 
typic behaviors elicited by aatectolamiine 
stimulant drugs, it has been observed T 
peatedly that adminstration of ae 
agonists and anticholinerestrase agents elimi 
nate or reduce stereotypy, whereas an 
muscarinics augment stereotypy (Arnfred & 
Randrup, 1968; Costall & Naylor, 1972; 
Fjalland & Moller-Nielson, 1974; Janowsky 
et al., 1972; Pedersen, 1967 ; Scheel-Kruger, 
1970). Since Stereotypy appears to be me- 
diated largely by dopamine ( Randrup & 
Munkvad, 1968, 1970), it has been posited 
that reciprocal innervation may exist be- 
tween the dopaminergic and cholinergic sys- 
tems. Further to this notion, it has been 
Teported that inhibition of dopamine syn- 
thesis reduces the effects of anticholinergics 
(Thornburg & Moore, 1973), while dopa- 
mine stimulants are potentiated wih anti- 
cholinergics (Arnfred & Randrup, 1968). 
Costall and Olley (1971) have shown that 
cataleptic behavior induced by the choliner- 
gic agent, arecoline, can be potentiated by 
lesion in the caudate putamen, and reduced 
by lesion of the substantia nigra, Tt js likely 
that the substantia nigra is the site of action 
of the cholinergic drug, and feedback exists 
between this site and the striatum ja the 
dopaminergic nigrostriatal system, 


y Thus, 
excitation of one transmitter may cause re- 
bound effects inas 


a second transmitter system. 
Indeed, 


eed, it has been reported (Ahtee & 
Kääriäinen, 1974) that 


Pilocarpine was accompanied by increased 


: acid, the metabolite of dopa- 

Out an increase in brain dopamine 
per se, Suggesting that the cholinergic stim- 
ulant produces increased metabolism of 
dopamine, It is noteworthy that these au- 
thors indicated that although the cholinergic 
and dopaminergic Systems are antagonistic 
and can compensate for the activity of one 
another, this j nly to a certain extent. 
able data, it appears that 
ation may promote a cho- 
and thus modify behavior. 
and Cooper (1971) have re- 
al data Suggesting that cho- 
> ation may also result in à! 
1€ rebound, They observed that al- 


S true o 

Given the avail 
adrenergic stimul 
linergic rebound 
Fibiger, Lynch, 
Ported addition 
Mergic stimul 
adrenerg 


r 
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D pie a cholinomimetic, initi- 
“4 B = behavioral inhibition, a subse- 
La aps behavioral excitation also 
lerine d ) hile pretreatment with scopo- 
unus Eon ee both the inhibitory and 
ne BN erai e ape of pilocarpine, scopolamine 
diim de with the onset of the hyper- 
^» phase potentiated the excitation. 

Scar bye the psychopharmacological 
PC en between systems 
Vioc at h nly recently, however, have 
neee fee become available that have 
ally d that systems are in fact mutu- 
observed th Migs Specifically, it has been 
tylcholine idee sinn associated with ace- 
jection uf (4 hd are increased following in- 
viorepbie Sateen: or such as 
uu PEE co 
ud dm n (Sethy & Van Woert, 1973) 
1971) ia hetamine (Mandel & Knapp, 
a is acne levels of catecholamines 
Seke e th yc Men on the other hand, 
inr nad A vc) activity of enzymes as- 
& ien "e (Benton, Moffatt, Rick, 
suggested m 2 de Recently it has been 
Consolo, Bi gid et al., 1974: Ladins 
Sethy & van Saminin, & Ghezzi, 1975 
an V den 

atum, eise qi. — iota 
role on ACh 2 a onic in ul pitory 
agents in ^ Dopamine stimulating 
decrees mag ACh levels. presumably by 
opal A the firing rate of ACh neurons. 
SEM pr dopamine blocking agents de- 
leasa ul er via an increase in the re- 
intireased: h. As might be predicted, the 
blocking n of ACh by dopamine- 
almus Cm is reversed by dopamine 
Lied & M Bartholini et al., 1973; Stadler, 
A idis Bartholini, 1974; Stadler, Lloyd, 
pee ae & Bartholini, 1973). However 
e d Aran and Bartholini (1973) have 
nina M although a balance of this 
ight in o in the striatum, this does not 
wilbetendi or limbic structures. This not- 
Seamer gel ing, it 18 apparent that reciprocal 
Mores uon between systems does exist. 
havi over, these data suggest that some be- 
naviors, such as tremorigenic effects due to 
might well involve 


low dopamine levels, 
acetylcholine as well. 
" From the side of manipulating acetylcho- 
ine levels and gauging the change in dopa- 
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been observed (Javoy, Agid, 
Bouvet, & Glowinski, 1974) that carbachol 
infusion into the substantia nigra reduced 
the utilization of newly formed dopamine in 
the neostriatum, but had no effect on syn- 
thesis. Atropine infusion, on the other hand, 
stimulated synthesis and utilization of dopa- 
Hitzemann, Loh, and Domino (1972) 
scopolamine treatment de- 
14C-dopamine 


mine, it has 


mine. 
reported that 
creased the accumulation of 
and 1C-norepinephrine in the mouse brain 
after administration of either ?'C-tyrosine Or 
1C-dopa. Similarly, several investigators 
have reported that anticholinergics antagonize 
increases in homovanillic acid induced by 
neuroleptics (Anden, 1972; Bowers & Roth, 
1972; O'Keefe, Sharman, & Vogt, 1970) 
and also induced modification of dopamine 
turnover (Anden & Bedard, 1971; Corrodi, 
Fuxe, & Lidbrink, 1972). Finally, Herman, 
Kmieciak-Kolada, Slominska-Zurek, and 
Szkilnik (1972) observed that intravenous 
ACh decreased levels of nore- 
cortex, mesencephalon, pons, 
while Kazic (1973) reported 
of physo- 


injection of 
pinephrine in 
and medulla, 


that intravenous administration 


stigmine markedly increased synthesis and 
pothala- 


turnover of norepinephrine in the hy 
mus. Taken together, these data indicate 
that modification of either acetylcholine of 
alia, will also result in à 


dopamine, inter 
change in transmitter level in the comple- 
mentary system. It seems that not only do 


cal activation OC- 


balances exist, but recipro : 
homeostatic 


curs and probably serves in à 


capacity. 
Effects of Stress ov 


Activity 

The position of the present pa 
modification of neurotransnitters may ac- 
count for response changes following €XPO- 


sure to an initial stress situation. That 1s, 
initial stress may result in increased cate- 


activity. which in turn produces 
cholinergic rebound, the maxima 
effects of which do not occur until ] hour 
after termination of the stress. Moreover, 
following the initial efflux of catecholamines, 
there is a large diminution of catecholamine 
action. Thus, soon after shock a state of 
motor hyperactivity exists, which favors ac- 


1 Neurotransmitter 


per is that 


cholamine 
a central 
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tive avoidance and is incompatible with re- 
sponse suppression. At an intermediate in- 
terval the cholinergic dominance produces 
behavioral inhibition and consequently poor 
active avoidance performance. The involve- 
ment of the catecholamine and cholinergic 


1963; Kato, Gozsy, Roy, & Groh, 1967 : 
Welch & Welch, 1968, 1970). 


egrades these 
amines intracellularly, Support for such a 


contention has come from the finding that 
stress increases O-methylated derivatives of 
catecholamines with negligible alterations in 
their oxidatively deaminated catabolites 
(Bliss, Ailion, & Zwanziger, 1968 ; Carlsson, 
1966; Javoy, Thierry, Kety, & G 
1968; Thierry, Javoy, Glowinski, 
1968). With sufficiently intense Stress, on 
the other hand, reduced levels of nore- 
pinephrine have been noted (Bliss et al., 
1968; Bliss & Zwanziger, 1966; Maynert & 
Levi, 1964), 

Thierry (1973) has Suggested that w 
the inception of a relatively 
newly synthesized norepineph 
"functional Storage pool” 
Severe stress may lead 
amines stored in the 
as well, thus leading 
total norepinephrine. 
ing administration of 


lowinski, 
& Kety, 


ith 
mild Stress, 
rine in the 
is utilized, whereas 
to utilization of 
“main compartment” 
lo greater decline of 
In any event, follow- 


a strong stressor, de- 
cline in norepinephrine becomes More ac. 
centuated 2-4 hours after application of the 


stressor (Barchas & Freedman, 1963 . Stolk 


Conner, Levine, & Barchas, 1974. Stotie 


1970; see also Rosecrans, 1969) and has 
been known to require as long as 1-6 hours 
before the level of norepinephrine is replen- 
ished (Barchas & Freedman, 1963; Maynert 
& Levi, 1964; Paulsen & Hess, 1963). The 
Sustained reduction in norepinephrine levels 
could come about when neuronal activity, 
probably owing to activity of cells in the 
locus coeruleus (Korf, Aghajanian, & Roth, 
1973), increases to the extent that amine 
release and utilization exceeds total biosyn- 
thesis (cf. Moore & Lariviere, 1963; Rose- 
crans, 1969; Welch & Welch, 1970). In 
effect, then, with the inception of a severe 
stressor, norepinephrine is released from the 
primary storage granules in large amounts, 
which initially may lead to behavioral exci- 
tation. However, when norepinephrine utili- 
zation exceeds synthesis, the total norepi- 
nephrine available is diminished, conse- 
quently resulting in behavioral inhibition. 
Weiss et al, (in press) have indicated that 
not only are norepinephrine levels decreased 
in stressed animals, but also reuptake of nor- 
epinephrine is increased, thereby further re- 
ducing the amount of norepinephrine which 
might impinge on the Postsynaptic receptor, 

While stress clearly alters Norepinephrine 
synthesis and release, it appears that the ex- 
tent of Norepinephrine change may be modi- 
fied by the organism's ability to cope with 
the _Stressor, 55, Stone, and Harrell 
(1970) and Weiss, Pohorecky, Dorros, Wil- 
liams, Emmel, Whittlesey, and Case (Note 


b Yen animals can 
» norepinephrine levels 
hereas yoked control ani- 
á significant reduction in whole 
Drain norepinephrine as well as in hypo- 
thalamus, brain stem, and telencephalon. 

eiss et al. (in press) also indicated that 
when animals can cope with the stress, either 
or passive responding, nor- 
els may in fact increase, as 
Pical decrease of yoked 
ng these same lines. 
Observed that in rats 
k, Norepinephrine levels 
fter stress, relative to 
Sin nonshocked animals. 
ere shocked in pairs, thus 


n á {ion 
Norepinephrine reductio 


if rats w 


eliciting fighting 
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was not observed. Tt may well be that 
fighting in shocked rats served as a coping 
mechanism, thus preventing the decline in 
norepinephrine. It would appear that modu- 
lation of neurotransmitter levels is stimulus 
bound, but this relationship is dependent on 
the availability of coping mechanisms. 

The effects of stressors on acetylcholine 
activity parallel the modifications observed 
With catecholamines. Stress of electric 
Shock tends to elevate acetylcholine levels 
V^ Aprison & Hingtgen, 1970; Aprison, 
ua ge x ume Lay A es 
tna, ACE D rem 1 eese of norepineph- 
hu xlification is dependent on the 
availability of coping responses. For exam- 
ple, Karczmar et al. (1973) indicated that 
Stress of an inescapable nature produced in- 
creases in ACh of 25% and 846 in the 
telencephalon and midbrain, respectively. 
4 lowever, when equivalent amounts of shock 
dim receiveid in an escape/avoidance task, a 

crease in ACh is observed only in the 
telencephalon. i 
Mis Se acuario transmitter modi- 
Pres b nc availability of coping responses 
n onsistent with the findings regarding the 

-shaped avoidance function. "That is, the 
U-shaped function is observed following ex- 
DE n inescapable shock or following 
d id 5d avoidance training trials. dn 
ibo i t iese conditions animals have diff- 
fire: Es coping with stress, since in the 
i st case there is no availability of learning 
a pone Mi contingencies, and in the 
À d case the learning of these contingen- 
cies may require several more trials. Asa 
result, brain norepinephrine decreases and 
ACh increases. When the stress is of a 
controllable nature, norepinephrine increases 
and ACh decreases. The latter set of events 
lends itself to efficient response initiation 
and consequently high rates of avoidance 
performance. In the former case response 
initiation should be retarded and avoidance 
should proceed relatively slowly. Moreover, 
time parameters for the depletion and re- 
plenishment of norepinephrine are congruent 
With the temporal variations of avoidance 
behavior, 

»overman, Klaiber, 


In a recent paper, 
(19074) have for- 


Vogel, and Kobayashi 
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warded an argument that in many ways is 
similar to that presented here. These in- 
vestigators suggested that in short-term 
stress situations, release of epinephrine may 
stimulate central norepinephrine via the 
reticular activating system and the hypothal- 
amus (see also Gellhorn, 1957), conse- 
quently resulting in increased responsivity. 
However, following prolonged stress, nor- 
epinephrine may be replaced by an increase 
of false transmitters. Accordingly, a state 
of central cholinergic dominance prevails, 
with concomitant inhibition of ongoing be- 
haviors. The thesis of the present paper 
is not in disagreement with this hypothesis, 
but suggests further that the initial nor- 
epinephrine stimulation may result in com- 
pensatory increases in ACh. Accordingly, 
the net result may be one of greater im- 
balance of the catecholamine-cholinergic sys- 
tems in favor of cholinergic action. More- 
over, if decreased catecholamine-receptor 
stimulation occurs because of accumulated 
false transmitters, as suggested by Brover- 
man et al. (1974), a mechanism for in- 
creased. ACh levels can be postulated, since 
dopamine apparently has a tonic inhibitory 
effect on ACh levels (Bartholini et al., 1973). 
It is noteworthy that the present formula- 
tion does not exclude the involvement of 
other transmitter systems. Specifically, 
serotonin (5-HT) has been found to in- 
crease following exposure to inescapable 
shock but not following escapable shock (cf. 
Karczmar et al., 1973). The fact that 5-HT 
may be involved in response initiation 
(Aghajanian & Sheard, 1968; Carlton & 
Advokat, 1973; Lorens, 1973; Steranka & 
Barrett, 1974) and in the modulation of 
ACh and norepinephrine action (Swonger & 
Rech, 1972) raises the possibility that this 
neurotransmitter could also contribute to 
time-dependent changes in avoidance. 


Pharmacological Manipulations 

A number of studies clearly indicate that 
alteration in neuronal transmission effec- 
tively modifies the time-dependent changes 
in avoidance behavior. For example, as 
seen in Figure 2. scopolamine, which func- 
tionally blocks ACh action, improves avoid- 
ance performance, presumably by reducing 
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as a function of Scopolamine and physosti; 
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ij inhibitory tendencies (Bignami & Rosic, 


been noted to decline when animals were 
i 1971; Carlton, 1969). anq also eliminates 
Ü : 


treated with drugs leg., Physostigmine) 
that increase ACh (Bignami & Gatti, 1966 ; 
Following treatment 
Studies (Anisman & Kokkinidis, 1974; with Physostigmine, the U-shaped function 
Thompson & Neilson, 1972; Brett, Note 1); imi i ask (see Fig- 
Toreover, these investigators have demon- ure 2, right-hand panel), At the brief and 
Strated that central but not periphera] ACh intermediate intervals. the increase in ACh 
modification will eliminate 
performance at the intermediate 
Predictably, septal lesions, which 
cholinergic input to the hippocampus (Lewis, iff 
Shute, & Silver, 1967 ; Lynch, Mathews, 
Mosko, Parks, & Cotman, 1972: 
& Srebro, 1973), also have bee, 
effectively reduce the time-depe; 
ations in avoidance behavior ( 
Kokkinidis, 1974). This parti 
might Suggest that the Septo-hj 
axis is intricately involved in the U. 


ard Performance. Thus, the 
Performance otherwise seen 
between the brief and intermediate intervals 


are precluded. At the long retention inter- 
val, the shock-induced Ch changes are 
Consequently the disruptive 
YSOstigmine on avoidance þe- 
iminished. Alternatively. the 
ade that Within 24 “hours 
„Taming the Memory of the cue. 
ell consolidated, 
ance is less suscep- 
the Pharmacologic 


and as à result p 


to disruption by 
manipulation, 


t We turn to the 
predictable i 


Manner (see, for example, Grossman, 1964; 


Kelsey & Grossman, 1969), 
In contrast to the effects of 
gics, avoidance performance t 


* it will be remembered 
ats a U-shaped function 
ed in the One-way task, It is 
€vels are highest and , 
most pronounced at bi 


anticholiner- 
Ypically has 
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the Intermediate retention interval. How- 
po freezing behavior can be sup- 
ences in d an the one-way task, differ- 
manifested f y iral Suppression. were. not 
tration di n me avoidance task. Adminis- 
Saso in ? = sostigmine produced an in- 
Pila ACh, which was greater in the 

ir group than in the remaining groups. 
icles ‘Suppression: was increased 
sary for tog that several trials were neces- 
become des A at A response to 
differences blis yed. Under these conditions, 
the varin E Eesponse suppression among 
Way task a oa were detected in the one- 
the Tres ie important point here is that 
aclion was ent X Retention Interval inter- 
being e ee simply due to performance 
Bnd ian - vey nonmodifiable at the brief 
The vw a vi pee ig to ceiling effects. 
established! by function can be eliminated or 
fitit. »y appropriate pharmacological 

s. 

sape eR to the involvement of the 
Mat admin: — it has been observed 
Proves ee ration of d-amphetamine im- 
1973. Ü vidance behavior (Barrett et al., 

; Pignami, Robustelli, Janku, & Bovet, 


1965. Kre; : 
65; Kreickhaus, Miller, & Zimmerman, 


1965) , as 
hae and also eliminates the U-shaped 
ction (Anisman & Kokkinidis, 1974: 


Seinen a> Note 4). Administration of 
dis, 1974) ae ies. ow (Anisman & Kokkini- 
& Kokki; i mea le agents (Anisman 
1972. ee m 974; Thompson & Neilson, 
Shaped ^ zs ote 1) do not affect the U- 
effects of i eye clearly indicating that the 
anticholin «d catecholamine stimulant and 
level iem are mediated at a central 
that dim EUM. the available data, it seems 
féms pi catecholamine and cholinergic sys- 
hon y g AIAJON, role in determining re- 
lerenden choses as well as the time- 
observed changes in behavior typically 
d. 

onu rag from the shock-induced sup- 
Shock, it ` vis hyperactivity produced by 
having a appears likely that the latter be- 
ity (A s mediated by norepinephrine activ- 
Y (Anisman & Cygan, in press). When 
eia receive systemic injection of d-am- 
tamine (1-10 mg/kg in mice), general 
Activity typically is observed to increase. 


Following exposure to noncontingent foot 
shock the extent of the hyperactivity is 
greatly magnified. The excitation of be- 
havior appears to involve central catechol- 
amine activity, since the peripherally acting 
agent, p-hydroxyamphetamine, does not pro- 
duce postshock excitation. When animals 
are pretreated with an inhibitor of dopa- 
mine-B-hydroxylase, bis-(4-methyl-1-homo- 
piperazinylthiocarbonyl) disulfide, the en- 
zyme responsible for the conversion of dopa- 
mine to norepinephrine, general activity and 
amphetamine-induced locomotor activity are 
not affected. Apparently, the locomotor 
stimulant properties of d-amphetamine are 
due to dopamine rather than norepinephrine 
activity (see, for example, Carlsson, 1970; 
Fuxe & Ungerstedt, 1970). However, pre- 
treatment with the dopamine-B-hydroxylase 
inhibitor entirely eliminated the amphet- 
amine-shock synergism. Together with 
earlier reports indicating that shock potenti- 
ates the turnover of norepinephrine induced 
by d-amphetamine (Javoy et al, 1968), 
these data suggest that changes in nor- 
epinephrine turnover subserve the behavioral 
excitation produced by shock. 

It is noteworthy that the postshock exci- 
tation was not apparent with treatment of 
scopolamine, although the anticholinergic 
effectively reduced the suppression of activ- 
itv produced by shock. Apparently, the 
initial rise of activity is probably specific 
to effects on catechalomine activity. Thus, 
although balance between the catecholamin- 
ergic and cholinergic systems may exist, 
this balance may be one in which the cho- 
linergic system mediates response inhibition/ 
disinhibition, whereas the effects of cate- 
cholamine activity are in excess of simple 
response disinhibition. 


Associative Versus Nonassociative Effects 
of Neurotransmitter Modification 


It might be argued from the standpoint 
of a memory hypothesis that although neuro- 
transmitters are involved in the production 
of the U-shaped function, this involvement 
is not at the nonassociative level, but rather 
directly affects memory processes. How- 


ever, a memory interpretation for the U- 
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Shaped function is Supported neither by the 
data from a purely behavioral vantage nor 
by studies evaluating the Source of the phar- 
macological effects On avoidance behavior. 
To be more explicit, it has been posited 
(Klein, 1972; Klein & Spear, 1970a, 1970b, 
1973; Spear, 1973; Spear, Klein, & Riley, 
1971) that the U-shaped function is due to 
retrieval failure at an intermediate interval, 


owing to time-dependent Variations in en- 
dogenous and 


one, epinephrine, norepinephrine, acetylcho- 
ine, as well as other unnamed endogenous 
Substances). These changes produce state- 
dependent (drug dissociated) effects, such 
retrieve the origi- 


Organism's state 
during testing is Comparable to that which 


existed during original training. [t is Sup- 
Posed that soon after shock the endogenous 
chemical changes normally elicited by a 
Stress experience are not yet drastic, and 
consequently memory remains relatively 
Stable. One hour after the stress experience, 
the endogenous chemical changes are maxi- 
mal, and as a result memory retrieval is 
impaired, Within 24 hours, the internal 
State of the animal returns to a basal leve] 
and the organism again can retrieve ade. 
quately the previous learned response, 

The major source of support for the mem- 
ory hypothesis has been based on Severa] 
experiments Conducted by Klein and Spear 
(1970a, 1970b), which indicated that when 
animals were trained on an active avoidance 
task and tested on à passi idance task 
(or vice vers 
Served at a brief Or 
to naive animals, 
mediate interval, pe 
animals, 7 


sion reached by these investig 
at the intermediate ir 


unable to retrieve th 
Tesponse-shock cont 


nterval the Organism is 
€ previously established 


tees ingency, Precluding the 
Possibility of negative arising 


When the Previously 
is intact, as at the brief or lor 
negative transfer arises hee. 
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flicting response requirements of the two 
tasks, i 
Paradoxically, employing a somewhai 


similar type of test situation, several investi- 
gators (eg, Anisman & Waller, 1971; Bar- 
rett, Leith, & Ray, 1971b; Dauer, 1973; de 
oledo & Black, 1970; Steranka & Barrett, 
1973) have Observed substantially different 
results. These investigators shocked ani- 
mals in a compartment of a particular color, 
and Subsequently had independent groups of 
Tats run either toward or away from the 
preshock compartment at one of various an- 
tervals. In both the conditions in which 
subjects were required to run towards or 
away from the Preshock stimuli, a U-shaped 
performance function was Observed. Not 
Surprisingly, animals required to run away 
from those cues previously associated with 
shock acquired the avoidance response more 
Teadily than animals required to run to- 
ward the preshock stimuli, © 
portance here is the finding th 
ence was observed regardless 
retest interval, Moreover, the data pre- 
sented by de Toledo and Black 
Steranka and Barrett (1973) 
that performance 
was inferior to that observed 
mals. Jf memory could not h 
the intermedia 
Klein 


Í prime im- 
at this differ- 


at the intermediate interval 


€ retrieved at 
al, as suggested by 
1970), then it 
vhether animals 
or toward the 


have Mattered y 
Were required fo run away 
Preshock stimuli, and 


groups should have performed equally poorly 
at the Intermediate interval, In addition, 


Performance at intermediate interval 
Should not have differed from that of naive 


animals, 

In this same vein, Anisman (1973a) has 
Observed that if relatively intense shock (2 
MA) wa 


g initial training, 
an intermediate interval 
at of a naive baseline 
lisrupted, then 
intermediate interval 
lave equalled, 9r perhaps exceeded, 
naive animals, Together 
(197 Ed reported bv Barrett et al. 
d lb) ana Steranka and Barrett (1973). 
Avoidance, but not discrimination 


| 


= 
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oe aus found to vary over re- 
Bene one s, these results suggest that 
od rn itato abilities rather than re- 
ance denci ce are responsible for the avoid- 
fs i sev 7g at the intermediate interval. 

Bé airal o say that associative lactors 
of ithe Ua UBER AYE in the production 
Fes ^ Yaped function, but rather that 
can he Fespenun initiation appear to ac- 
ance’ than x greater portion ot the vari- 
abilities o potential changes in retrieval 
rbi suggested, concerning the 
caseron diem steroids, that since corti- 
Wl a Simia S vary following the inception 
by the erta he are differentially affected 
» bes : nt o control animals have over 
1971c) ihis (eg., W eiss, 197 1a, 197 1b, 
lepet dnt RRN might mediate the time- 
& Levine v in performance (Brush 
ws "de 6; Levine & Brush, 1967 ; 
Port of en aris, & Brush, 1971). In sup- 
B ocei d. ue Brush and Levine (1906) 
ditioning ES aer time following fear con- 
related s rony m corticosterone Cor- 
pletion of 3 avoidance behavior. The de- 
rupt oo is thought to dis- 
unable to ance, either because animals are 
etai bor. a subsequent stress or 
uds Dod ailure in memory retrieval 

a pasikon d However, contrary to such 
nd any ee studies have failed to 
Mines on a " pituitary-adrenal manipu- 
et al 19710. pee function (Barrett 
Ther io Kasper-Pandi, Hansing, & 
Subs: rete Snider, Marquis, Black, & 
Platenius is i Suboski, Marquis, Black, & 
ionship A. 70), suggesting that the rela- 
ie desine tween corticosterone level and 
Dore m avoidance performance at an 
i mu os api is not a causal one. It 
nephrine e that hypothalamic norepi- 
Cir eatrasie : tonic inhibitory role on the 
the dedi. ire casing factor. ] Thus, with 
iiredusses D epinephrine following the 
released (S ota stressor, corticosterone is 
osi, 1973) ccm Annunziato, & Prezi- 
release r^ er get the corticosterone 
fe degen e that of norepinephrine, would 
sty dent on whether the stress is one 
sn 1e animal can cope with. In any 
nt, the important point for the present 


in 
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purpose is that the U-shaped function is not 
causally linked to changes in adrenal steroids. 

If we turn to the effects of drug treat- 
ments on the modification of behavior, it 
seems unlikely that the pharmacological 
treatments that alter the U-shaped function 
alfect memory processes. For example, 

Jarrett et al. (1973) observed that although 

scopolamine and d-amphetamine improved 
avoidance performance in a Y-maze, there 
were no changes in discrimination learning 
as a result of the drug treatment. Evidently, 
the pharmacological treatment affects re- 
sponse-modulating (i.e., disinhibitory) sys- 
tems as opposed to memory processes. Em- 
ploying a somewhat different approach, Big- 
nami and his associates (Bignami, Amorico, 
Frontali, & Rosic, 1971; Bignami & Rosic, 
1971; Rosic & Bignami, 1970) reported 
data that have lent substantial credence to a 
response modulation hypothesis. These in- 
vestigators observed that in a go-no-go situ- 
ation, scopolamine increased errors of com- 
mission but decreased errors of omission. 
The drug treatment apparently decreased 
response inhibition, thereby increasing the 
probability of active responses being made 
and concomitantly diminishing the effective- 
ness of one component of the passive avoid- 
ance response. Results recently obtained in 
our laboratory (Anisman, in press) have 
replicated Barrett's earlier findings that sco- 
polamine and d-amphetamine improved ac- 
tive Y-maze avoidance but had negligible ef- 
fects on discrimination performance. Pre- 
dictably, both agents tended to disrupt in- 
hibitory avoidance behavior. Had the drug 
treatments altered memory, then facilitative 
effects should have been observed in active 
avoidance, inhibitory avoidance, and dis- 
crimination performance. Together, these 
experiments, as well as additional data re- 
viewed by Bignami (in press) and Carlton 
and Markiewicz (1971), indicate that the 
actions of scopolamine and d-amphetamine 
are task specific in that they affect inhibitory 
tendencies. 

The notion that scopolamine and d-am- 
phetamine selectively modulate tasks involv- 
ing inhibitory tendencies has been verified in 
a series of experiments designed to examine 
just such a possibility. Specifically, Suits 


374 


and Isaacson (1968; see also Anisman, 
1973a) reported that the facilitative effects 
of the drug treatments were apparent in a 
shuttle task but not in a one-way task. In 
the shuttle task, where freezing is relatively 
high, scopolamine improves performance 
through the reduction of freezing behavior, 
In contrast, when freezing ordinarily can be 
Suppressed readily, as is the case in the one- 
way task, beneficial effects elicited by further 
elimination of response suppression are 
negligible. In a similar vein, Oliverio 
(1967) has reported that the beneficial ef- 
fects of scopolamine are evident early in 
training, when freezing presumably is rela. 
tively high, but not later in training, when 
freezing has been Suppressed to some extent, 
Finally, data Teported by Anisman (1975; 
for complete reviews of relevant literature, 
see Dignami, in press; Bignami & Rosic, 
1971 ; Carlton, 1969; Carlton & Markiewicz, 
1971; Longo, 1966) also indicated that if 
the task is very difficult, the beneficial effects 
of scopolamine and amphetamine may not be 
apparent. It seems that the presence of a 
associative component may 
override the Tesponse-modulating effects of 


In effect, 


the mere 
increase in response-initiating 


and avoid- 


ance is limited py response inhibition, then 
the pharmacological treatments may improve 


Appear to alter associ 
While the basi 
paper is that no 


; nassociatiye processes are 
mediated py 


, eurochemica] vents, it is 
necessary to interject several caveats, In 
particular, the argument may be advanced 
that although noncontingent Shock is fol- 
lowed by behavioral and neurochemical 
changes, the relationship between these 
events is simply correlational and does not 
imply causality. The stress situation could 
set in „Motion unknown mechanisms that 
result in response repertoire Modification 
and quite independent], produce 


| neuro- 
chemical changes. Alternatively 


the re- 
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; 
sponse repertoire changes themselves ps 
elicit neurochemical modification. Tiea 
that pharmacological manipulations pe 
predictable changes in behavior pd 
against such positions and suggests a ken 
relationship between neurochemical eo 
and nonassociative processes. Neverthe en- 
at this time it is premature to dismiss i 
tirely the possibility that associative et 
esses or response-repertoire changes in 
€nce neurochemical events. zer AH 

Second caveat, discussed carlier Ww ; 
respect to time-dependent changes in M 
formance, needs to be reiterated ere t- 
terms of the effects of pharmacological aie’ 
ments, Specifically, although motor va 
hancement or response disinhibition favor 
improved avoidance performance, pharma 
cological treatments that eliminate these non- 
associative factors do not insure high levels), 
of avoidance, In the presence of adequate 
associative Processes, treatments such 
i ce performance it ~ 
he limiting factor fc 
In some tasks the re- 3j 


i 


Sponse suppression is t 
high avoidance rate, 


Sponse enhancing or disinhibitory qualities 
of a drug may disrupt rather than augment i 
performance 


- For example, 
are available indicating 
may disrupt one-way $ 

ance (cf. Bignami et 
tradictions 


several reports 
that opolamine | 
" and pole-climb avoid- 
aL, 1965). These con- 
apparently are a consequence of - 
the T 6Sponse-repertoire change induced by | 
the drug, together with the response requisite 
of the avoidance task. In the one-way task,” 
incompatible responses established by shock 
are suppressed quickly, Thus, an increase 
™ exploration produced by scopolamine may 
Compete with effective defensive behaviors ' 
that occur otherwise, Similarly, in the case 
od pole climbing, the increase of motor ac- 
tivity may well pe incompatible with the | 
climbing/grasping behavior necessary d A 
efficient avoidance, Ti seenis that in order | 


: ü rugs, it 
to analyze the behavioral effects of drugs, 
'S not sufficien 


nases created 


t simply to view the response 
by the pharmacological vei 
ments, but rather the conflicts between these 
Diases together with the various components 
of the avoidance task should be evaluated. | 
Indeed, Precisely such formulations yield 
additional important evidence regarding the 


‘| 


|] 


q of behavior 
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tic, Coping behavior and response-reper- 
following p s the modulation of behavior 
passive cont Ir treatment In the case of 
sible, E tance, coping behavior is pos- 
gélicies i ar ees response-shock contin- 
shouid er z Ithough acetylcholine levels 
ase ue do under these conditions 
Bud aiio. [ iscus ion), scopolamine effec- 
disrupts ae inhibitory tendencies and 
1971). ee mance (cf. Bignami & Rosic, 
dependent 3 y, the effects of the drugs are 
is s do response styles rather than 
Fin ts ates of coping behavior per se. 
i the ae additional limiting condition 
and its relati M an snil Er modification 
be e b up to performance should 
Scone eee W hen considering the inter- 
mitter chane E coping behavior, neurotrans- 
necessary “i and response requisites, it is 
y to ig a that any predictions 
Ei m are dependent on species and 
on ie In addition to Cee 
eren t suppression, various strains 
to drug LUN found to differ in response 
foo e i and the interaction be- 
üt Croan 4 Satiments and shock (Anisman 
" Similarly gui ess; Anisman et al., in press). 
» exit ranig of mice have been found 
aid mins Er e levels of catecholamines 
ciated with : mE as well as enzymes asso- 
metet, & M; ej transmitters ( Ebel, Her- 
& Calhoun DA 1973; Pryor, Schlesinger, 
variations j^ 56). Given the existing strain 
in oie Mie oscar au activity and 
Prising to Pu SUES, it should not be sur- 
Variations du strain specific temporal 
shock 4. the nonassociative effects of 
diferent n 1973a). Similarly, since 
ashibe oe of rats have been found to 
di anal dep response styles (in terms 
contineat dite, rare. following non- 
that EA E shock, it is predictable 
sponding w wr less anticipatory re- 
Re te eim d exhibit poorer active avoid- 
Paco "hien ae & Mills, 1974) and 
longed imas ion of performance after pro- 
avoldable s ire to signaled inescapable/un- 
i able shock (Shurman & Katzev, 1975). 
T ipei strain differences may con- 
Ns ead greatly to the effects of shock on 
ter ance via inherent differences in 
ms of their response to aversive stimula- 
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tion. At the same time a genetic/biochemi- 
cal approach in the evaluation of behavior 
phenomena may be useful in drawing frac- 
ture lines between the roles of response con- 
tingencies, coping style, and neurotransmit- 
ter activity in the modulation of perform- 
ance. That is, it should be possible to eval- 
uate the effects of a particular treatment on 
strains exhibiting similar response styles in 
the face of contingent or noncontingent 
stress but differing in neurochemical activity. 
Under these conditions, or permutations of 
these conditions, it may be possible to eval- 
uate the role of a variety of factors, at both 
an associative and nonassociative level, in 
determining ultimate response rates (cf. 
Anisman, 1975, in press; Anisman & Cygan, 
in press; Anisman et al., in press). While 
at present there appears to be a central core 
of correlates of nonassociative factors, such 
as variations in neurotransmitter activity 
and stress-induced excitation and inhibition 
of behavior, this definition must be con- 
sidered provisional, pending evaluation of 
other potential concomitants of contingent 
stress, for example, coping by anticipatory 
responding (Katzev & Mills, 1974) and 
pseudoconditioning (Izquierdo, 1974). 


Extensions and Implications 


The model outlined in the previous sec- 
tions is applicable to other stress-related phe- 
nomena. Some of these are presented be- 
low; additional pertinent implications are 
available in Gellhorn (1957). 

One particularly fascinating phenomenon 
that has received considerable attention over 
the past few years is the finding that animals 
subjected to inescapable/unavoidable aver- 
sive stimulation subsequently exhibit im- 
poverished escape/avoidance performance 
(cf. Anisman & Waller, 1972, 1973: Maier, 
Seligman, & Solomon, 1969; Seligman, 
Maier, & Solomon, 1971). Seligman and 
his associates have attributed the disruptive 
influence of the initial stress to “learned 
helplessness.” Specifically, in the face of 
inescapable unavoidable stress, animals learn 
that their responses are independent of shock 
offset. Accordingly, in subsequent encoun- 
ters with similar stressors, the organism 
neither attempts to escape nor to avoid the 
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aversive stimuli. In support of this hypoth- 
esis it was observed that if animals were 
first trained in an avoidance task and then 
subjected to the inescapable shock procedure, 
subsequent avoidance performance was not 
disrupted. In effect, the suggestion by these 
investigators is that if animals learn that 
control is possible, helplessness will not re- 
sult from the inescapable shock procedure. 
As an alternative to such a model, several 
investigators have suggested that the dis- 
ruptive effects of inescapable shock are due 
to either associative (Weiss, Kreickhaus, & 
Conte, 1968) or nonassociative (Anisman & 
Waller, 1971, 1972, 1973b) competing re- 
sponses elicited by the initial stress experi- 
ence. Moreover, Weiss et al. (1970) have 
suggested that the freezing response may be 
mediated by decreased levels of norepineph- 
rine, owing to the severity of the shock pro- 
cedure used, together with the lack of coping 
responses available to the animal (see pre- 
vious discussion). When animals have some 
control over the stressor, then norepineph- 
rine levels are not greatly modified, and con- 
sequently disruption in performance is not 
observed. Clearly, the two hypotheses are 
quite similar; the latter assuming that ulti- 
mately, nonassociative effects mediated by 
neurochemical changes are responsible for 
the disruptive effects of inescapable shock, 
while the helplessness hypothesis attributes 
the inescapable shock effects to cognitive fac- 
tors without discarding biochemical models, 


ctions follow from 
the neurochemica] model : 


capable 
energic 
of shock, 


ould pro- 
ges in behavior follow- 


action or throug! 
then modification of ACh levels sh 
mote predictable chan 
ing inescapable shock, 

3. If the disruptive effects of the inescap- 
able shock procedure are mediated primarily 
by neurochemical modification and not as- 
Sociative processes per se (iie, learning that 
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shock offset is independent of responses); 
then it follows that disruption in pentan 
ance should be modifiable through pus 
cologic treatments, without resort to ae if 
escapable shock procedure. lapsi oi v 
disruption in performance under such cor ht 
tions is observed, then these effects pes s 
to be subject. to modification. in the pe 
wav as the inescapable shock effects dd 
modifiable, For example, prior peer oe 
training should immunize the animal agains 
drug-induced disruption of performance. ! 
Much of the currently available data Bue 
in accord with these predictions. For ex- 
ample, it was observed that d-amphetamine 
(Anisman & Kokkinidis, 1974) or scopol- 
amine (Anisman, 1973a) eliminated the 
disruptive effects of inescapable shock, 
whereas increasing ACh levels through 
treatment with physostigmine exacerbated 
the disruptive effects of the inescapable 
shock procedure (Anisman, 1973a). Simi- 
larly, Thomas and Balter (in press) found 
that direct application of an anticholinergic 
to the septal area alleviated the disruption of 
avoidance induced by the initial str 88 pro- 
cedure. Clearly, the effects of inescapable 
shock are modifiable yia pharmacological 
Manipulations, Of prime importance, how- 
ever, is that both central catecholamine or 
cholinergic manipulations produce similar 
effects, thus implicating both of these sys- 
tems in the involvement of the disruptive 
influences of inescapable shock, Inasmuch 
as the pharmacological agents employed do 
not have profound influences on associative 
Processes (vis-à-vis earlier discussion), it is 
likely that nonassociative factors mediate the 
poor performance followi 
shock procedure, 
With respect to th 
ments that increase ACh or 


cholamine levels in the 
procedure, the 


with caution, 


ng the inescapable 
€ effects of drug treat- 
decrease cate- 
absence of a shock 
available data must be treated 
since with higher dosages pro- 
Srüption of motorie abilities are 
produced, ‘cordingly, only those studies 
in which Io i x 


comotor disruption is not appar- 
plicable t 


M ; © the present analysis. 
h oreover, In order that a drug treatment 
be Pertinent to the current discussion, the 
drug 


must be capable of mimicking the ef- 


the 


A 
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acm: observed in the inescapable 
borne 5 ely With these stipulations 
panies Pg there are several reports 
eseapable e € that the effects of in- 
uS vits om may be mimicked simply 
anie poem ogical treatments. For ex- 
add n =, et al. (in press) reported that 
Dee en i erac levels with tetra- 
Klandly omis pas behavior which resembled 
E nsi de inescapable shock treatments, 
difficult s m was observed in a fairly 
per task uttle task but not m a sim- 
sponse. Siniiizi, Pi & mosëpressing ae 
(1974) v ies a y. Fibiger, Phillips, and Zis 
(6-OHDA) dem that 6-hydroxydopanimne 
sith wipes gu of the substantia nigra, 
ee 2 t norepinephrine and dopamine 
active ia sspe the acquisition of an 
metn E pen response. However, if the 
Sstuption p jeen established previously, no 
the 6-OH EN formance was produced by 
Gn P treatment. 

mL ee ud e data regarding de- 
oliscevedl niet : - rolamines, it has been 
levels nein ently that increasing ACh 
(cf. Bienen a car 1968 pi pope 
1975)." As E = 1966; Bignami et al., 
iw, thes M 1e inescapable shock par- 

sruption of shuttle avoidance 


with incre 
bare uae levels of ACh was sub- 
N ally greater than that seen in a one- 
a av PT : - 
y avoidance task (Anisman, 1973a). 


E 
enlace d it seems to be the case that 
ite itio o € that increase ACh levels 
DN ad Nw ape in animals that show low 
ada eth s i of performance than in ani- 
Staton Toe and more stable baselines 
1971) dr vi a; Rosic, Bignami, & Gatti, 
Nét en 4 ess potent on already established 
Hioosses ap than un we e 
al, 1978). . Anisman, 1973a; Bignant et 
ot — M es it seems that the effects 
ae : de shock on avoidance can be 
dolane * —— that decrease cate- 
eeen e s or increase ACh levels, 
isque h bat the disruptive effects of in- 
Hb ios Shock may be mediated. by vari- 
qp eurotransmitter levels. This is not 
Selge rat helplessness. as suggested by 
e S a viable position. Clearly. 
ansmitter alterations are dependent 


on whether or not animals can cope with 
the stressor. Rather, the suggestion here is 
that the invocation of a cognitive model, at 
least in the animal subject, is superfluous to 
the understanding of the disruptive effects 
of inescapable shock. 

Recently, Seligman and his associates 
(Miller & Seligman, 1973; Seligman, 1974; 
Seligman, Klein, & Miller, in press) sug- 
gested that the helplessness notion might 
serve as a useful analogue for reactive de- 
pression in humans, in that many of the 
symptomatologies appear similar. "The haz- 
ards inherent in such generalizations are 
well known. Nevertheless, the suggestion is 
a tantalizing one. Inasmuch as depression 
is typically modified with treatments that 
increase catecholamine activity (cf. Schild- 
krau, 1965), as well as those treatments that 
decrease cholinergic activity (Janowsky et 
al., 1972; see also Lloyd & Hornykiewicz, 
1974, for review of this literature), it is 
likely that these neurotransmitters are in- 
volved in the mediation of depressive be- 


haviors. Whether or not the biochemical 
changes are in fact due to uncontrollable 


stresses is not clear. However. if a model 
of depression involving the helplessness no- 
tion is to be employed, it would be advan- 
tageous to include the neurotransmitter no- 
and especially that of balanced trans- 
into these formulations. 
also is amenable to other 
as those 


tion, 
mitter systems, 
Such a position 
currently existing models, such 
which suggest that genetic influences may be 
involved in predisposing individuals to de- 
pression (Akiskal & McKinney, 1973; 
Winokur, 1973). Thus, one might specu- 
late that inherited biochemical levels is the 
underlying phenotype regarding the be- 
havioral response to various stressors. 
Again, a cautionary note needs to be inter- 
jected here, namely, although pharmacologi- 
cal intervention produces beneficial results 
in depressive behaviors. suggesting à causal 
link between chemical levels and the be- 
the possibility still exists that the 


havior, 
may come under 


neurotransmitter activity 

cognitive control or is affected by cognitive 
Moreover, in dealing with stress 

nical relations it is necessary to define 

ate coping in differ- 


state. 
biocher 
what represents inadequ 
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ent individuals, and other things being 
equal, how do different individuals react, n 
terms of biochemical changes, to a standar 
difficulty in coping behavior. 

Finally, another stress-related occurrence 
in which rebound effects may play a major 
role is that of gastric ulceration, Specifi- 
cally, Weiss (1971a, 1971b, 1971c) has ob- 
served that in those stress situations in which 
feedback is unavailable, animals do not have 
methods for coping with stress, and when 
shock is made contingent on appropriate 
coping responses, large amounts of ulcer- 
ation typically are observed. Each of these 
Situations has been noted to be associated 
with low levels of norepinephrine. 
much as ulceration typically occurs only 
after several hours following stress termina- 
tion (Desiderato, Mackinnon, & Hissom, 
1974), it is likely that the initial stress- 
induced norepinephrine reaction is countered 
by both central and peripheral cholinergic 
action. The parasympathetic rebound would 
be expected to result in vagal innervation 
and hydrochloric acid secretion, thereby pro- 
moting ulceration. This notion is clearly 
speculative, however, and requires further 
evaluation. 

While the present paper has suggested 
that rebound effects may be responsible for 
several stress-related phenomena 
be clear that such a notion may be amenable 
to other clinical phenomena. For example, 
Gellhorn (1957) has Suggested that phe- 
nomena such as functional psychosis, emo- 
tional responsivity, and hormonal regulation 
might involve a rebound type of model. 
Similarly, it is likely that other paradoxical 
and poorly understood phenomena, Such as 
the enigmatic effects of Sympathomimetics in 
hyperkinetic children or dual effects of lith- 
ium in manic and depressive States, involve 
the innervation of antagonistic Systems fol- 
lowing excessive stimulation, 


Inas- 


, it should 


Concluding Remarks and Summary 


While associative factors cle 
rate-limiting step in the acquis, 
sively motivated behaviors, the 
the present paper is that nonas, 
tors similarly determine avoid 
rate. Needless to say, in t 


arly act as a 
ition of aver- 
Suggestion of 
Sociative fac- 
ance response 
1€ absence of 
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associative abilities, reduction of ripen 
ble nonassociative factors will not pem. 
learning. However, in the presence pc 
associative abilities, nonassociative effe T 
elicited by a stressor may hinder T eui 
ance. When these nonassociative ipee, in 
eliminated, the observed response rate 15 4 
truer indication of learned behaviors. . 
Within aversively motivated Bata En. 
nonassociative factors are largely mediates 
by variation in neurotransmitter activity- 
Reduction in norepinephrine and/or dopa- 
mine or increases in acetylcholine result in 
increased response inhibition, which aug- 
ments passive avoidance but disrupts active 


responding. Modification of transmitter 
levels should effectively alter performance 
level 


if poor performance is a result of non- 
associative factors. Although behaviors may 
be altered via manipulations of 
transmitter, it is likely that several trans- 
mitters act jointly to determine the ultimate 
response rate. Basically, the suggestion was 
made that catecholamines and acetylcholine 
act in a balanced antagonistic manner, 
thereby maintaining neurochemical home- 
ostasis. However, when the balance moves 
in favor of cholinergic dominance, active re- 
sponding is inhibited in aversively motivated 
paradigms, 7 


In addition to the existence of balanced 
transmitter systems, it is likely that with 
excessive stimulation of one system, an an- 
tagonistic rebound May occur in a second 
System in order to maintain neurochemical 
stability, Paradoxical behavioral effects may 
occur, however, since the rebound may be a 
sluggish one in terms of onset and termina- 
tion, while the initial transmitter change 
elicited by direct stimulation may be rapid in 
onset and transient in duration. The inclu- 
Sion of such a model adequately explains 
several time-dependent phenomena associ- 
ated with aversive stimulation, as well as 
phenomena dealing with traumatic stresses 
and factors dealing with control over stress. 


Finally, it can be applied as a model to 
clinically related behaviors, 


a particular 
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LETTERS TO 


Neural Basis of Shape Constancy 


Milner's article on visual shape recognition 
is valuable for spelling out plausible possibili- 
ties of feature analyzing circuits that are quite 
amenable to experimental test (Milner, 1974). 
However, his model does not cover the im- 
portant question, less accessible to present 
physiological techniques, of the Spatial rel 
tions of the features internal to a pattern. 

Although certain details must be discarded 
to achieve perceptual constancy, these feature 
analyzers lose critical information, For ex- 
ample, the distance ratio cell preserves infor- 
mation about the ratio but not about which line 
is which. It is not clear whether the mechanism 
that restores information about locus, for pur- 
poses of directing responses, also works for 
internal structure, If the net effect 
nonsynchronized firing of cells 
different parts of a display is simply to con- 
vert a question of spatial organization into 
one of temporal organization, then the basic 
problem still is not solved, . 

It seems doubtful that feature analysis 
Sensory systems continues on 
toward single, "pontifical" 
for every possible experienc 
lock, 1968, pp. 284-286), 
levels of analysis might 
spatial stimulus information into some sort of 
organized pattern of multineuronal interaction 
(e.g., Pribram, 1971), aided by feature analysis 
Properties of single neurons, 

Brain activity is visible through amplification 
channels only at a few points at a time. Until 
some fundamental advance in instrumentation 
occurs, we need inferential tools that 
to recognize possible organized 
neural networks, 


as 


of the 
that perceive 
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at higher levels 
neurons, prepared 
e (Perkel & Bul- 
Alternatively, higher 
involve mapping of 


allow us 
activity in 
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Reply by Milner 


Glassman is correct in pointing out that the 
attention mechanism I described functions only 
for whole objects and cannot explain how, for 
example, one might focus attention on the top 
left-hand corner of a figure, or the brushwork 
on the grass in the foreground of a painting- 
I think the answer to this must lie in the com- 
plexity of the available concepts, 
abstract Concepts such as “top,” 
“middle,” and so on. If sever 
could be activated simultane: 
result in levels of neural 
certain parts of the 
and a consequent pr 
there for controlling 

It is possible that I 
that the features resul 
described in my 


especially 
"second," 
al such concepts 
ously, it would 
activity higher in 
visual field than in others 
referential use of neurons 
responses, 

did not adequately stress 
ting from the processing 
paper Converge again at a 
concept (or "cell-assembly" ) level, Glassman 
Apparently considers it necessary to emphasize 
that relationship with his point regarding the 
unlikelihood of “pontifical” neurons. l] en- 
tirely agree with theories of multineuronal 
Tepresentation of Concepts, but I am not too 
Optimistic that additional “inferential tools" 
can be substituted for improved technology. 
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The Endogenous-Exogenous Partition in Attribution Theory 


Arie W. Kruglanski 
Tel-Aviv University 
Tel-Aviv, Israel 


several significant inferences are as- 


Within lay explanation of actions, 
between endogenous and exogenous 


sumed to follow from the partition 
attributions. An action is said to be endogenously attributed when it is 
judged to constitute an end in itself and exogenously attributed when it is 
judged to serve as a means to some further end. A theoretical framework 
is developed whereby the action’s endogenous attribution is linked with the 


inferences of intrinsic motivation, subjective freedom, and the action’s under- 


lying intention. The endogenous-exogenous distinction is proposed to re- 
place the frequently invoked partition. between the action's internal and 
external causes. Both conceptual and empirical considerations are put forth 
in favor of such a replacement. Classical attribution topics to which the 
internal-external partition has been applied are reinterpreted in terms of the 
endogenous-exogenous distinction, and novel data are reported that support 
the latter framework. Finally, several categories of conditions for endoge- 
nous (or exogenous) attributions are identified, and possible directions of 
further research within the endogenous-exogenous framework are suggested. 


res & Davis, 1965; as basic to attribution processes (Kelley, 
Nisbett, Valins, & 1967, p. 194) in the sense that the designa- 
7. ]971a, 1971b, tion of a cause as internal has been assumed 


ation of effects to identify it meaningfully so that explana- 


causes. Fre- tion of the effect would readily follow. In 
ave been particular the internal-external framework 
was assume 


Attribution theory (Jo! 
inen Kanouse, Kelley, 
Veiner, 1972; Kelley, 196 
1973) deals with the explan 
via the identification of their 
quently the causes of behaviors h 


partitioned into those internal to the person d to yield inferences as diverse 
and those external to him (internal causes aS those of (a) intrinsic. motivation (see, 
have been alternatively labeled as personal | €. deCharms, 1968; Deci, 1971). (b) sub- 
and external as environ The in- jective freedom (Kelley, 1967: Kruglanski 
ternal-external division regarded & Cohen, 1973). (c) perceived. pain (Nis- 
Ree | bett & Valins, 1971). (D positive attitudes 
I am indebted to Bonnie Erickson, cem Teka (Bem, 1972), amd (Ce) gincEfa reporting 
i ed Jones, Hal Kelley, Moshe Kroy, Andrea “ii (Bem 1972; Jones & Davis, 1965). 
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endogenous-exogenous distinction affords all 
of the inferences accredited. unjustifiably to 
the internal-external partition between the 
action’s causes. To that end classical at- 
tribution experiments that have relied on 
the internal-external framework are re- 
viewed and reinterpreted, and in some cases 
novel data are reported that support the 
endogenous-exogenous distinction. But be. 
fore a discussion is undertaken of the 
specific explanatory properties of the endo- 
genous-exogenous framework, it IS neces- 
sary to consider the more general problem 
of explanation through attribution. This is 
attempted in what follows. 


Attributional Explanation 


The lay person’s inference Process is as- 
sumed to comprise two separate steps, causal 
assignment and causal explanation, The first 
Step has been analyzed closely by the attri- 
bution theorists (in particular by Kelley, 
1967, 1971a, 1971b, 1973), During this 
stage the attributor aims at identifying the 
cause of an effect via analysis of i 
i of hypothesis testing (cf, 
Kelley, 1967), However, little attention has 
been paid to the second 
the stage of causal explanation. This is the 
problem of drawing sor 
the effect once its cau 
fied. More specifically, attributional expla- 
nation is assumed to occur when the effect's 
determinant has been identified, such that 
Certain inferences follow beyond those con- 
ion as such. 
th an effect, ș 
Several possible determin 
“assignment sta 


9 determine whi. 
may be most plausibly yj 


established, 
at is, certain 
may 
classified into two levels, 
functional and the derivati 
functional leye] 
Eaker, 1972) the 
tionally related to some “¢ 
(X), characterized in empirica 
fional (ag Opposed to theory, 


be Convenient] y, 


ui 


For example, in the explanatory "-— R 
"John handed over his wallet because v 

had asked him to do so," Paul's. asking 
would constitute a functional explainer, pe 
it designates an empirical event devoid es 
conceptual content. The information eae 
that accrues from deciding that the effec 
has been functionally related to some em- 
Pirically defined determinant is confined i. 
the assertion of covariance between the two. 
Speaking more precisely, everything else 
being constant, X (the determinant) has 
been identified as the necessary as well as 
the sufficient condition of b (the effect). 
Thus, the elimination of X will do away 
with b, and the reintroduction of X will re- 
produce b as well, In terms of our example, 
all that may be said is that in the specific 
circumstances Paul's asking (for the wallet) 


would suffice and be necessary to ensure 
John's compliance, 


A derivative-leve] expl 
à functional relation, 
minant is now characterized 
Opposed to empirically 
tional level), For instance John's yi 
of the wallet might be explained by refer- 
ence to the appropriate request coming from 
an armed bandit. The difference between 
an empirical and a theoretica] characteriza- 
tion of an explainer is that the latter affords 
derivations that transcend the assertion of 
covariance between the determinant and the 
effect, while the former's implications are 
exhausted with the covariance assertion, In 
our example, the lay-theoretic concept, 
armed-bandit, affords “several informative 
derivations, for example, that John was ex- 
Periencing fright while yielding the wallet, 
that harm might have befallen him had he 
refused, or that most people in his place 
would have als To state this dif- 
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also complied. 
lerivative explanatory level 
inant X carries with it a 
> of which b is but one 
Y» b. c, d, and e, while the deter- 
. i i arry a different set of im- 
Plications Containing p, for example, b, e, f, 
Thus, by identifying b's antece- 
one may derive c, d, and Y 
A identifying b’s antecedent as 
E g, and h will follow. 


Generally, 
explanation as the extent 
tial gain has been made (or as the size of 
the implicational set) that accrues from 
characterizing the antecedent of an effect in 
a particular way. It follows that explana- 
tion at the functional level is weaker than 
one at the derivative level, and within the 
latter level the degree of explanation will 
vary with the number of derivations com- 
prised in (ie. the theoretical content of) 
the antecedent term of the explanatory 
proposition. 

The notions above facilitate the analysis of 
the explanatory properties of the endoge- 
nous-exogenous partition. But first let us 
lay out the rationale for preferring (in some 
domains of explanation) the latter classifica- 
tion over the current alternative, the distinc- 
tion between internal and external causes. 


one may define the degree of 
to which inferen- 


Actions Versus Occurrences 


It is useful to classify the various effects 
to which attributional analyses may be ap- 
plied into two major categories: occurrences 
and actions, The fundamental distinction 
between these two is that actions are volun- 
tary while occurrences are not. In other 
words, an action is commonly assumed to 
be determined by the actor's will, while an 
occurence is some significant degree inde- 
pendent of the will. For instance, effects 
Such as "running," “stepping on a scale," 
or "uttering a sound" would qualify as ac- 
tions, because persons may bring them about 
simply by deciding to do so. By contrast, 
"winning à race," “weighing 156 pounds," 
or "being heard" would represent occur- 
rences, as would “running a fever,” “feeling 
pleased,” or “experiencing an acute anx- 
iety," Note that with the former three oc- 
currences some action was presupposed (e.g. 
winning a race presupposses running it). 
while none was presupposed with the latter 
three, 

But a common feature of occurrences as 
a class 18 that one may not engender them 
mere willing: The will to succeed may 
not guarantee success, nOT, alas, may the 
will to weigh 105 pounds guarantee the ap- 

le reading. Now, it is the oc- 


propriate sca . 
* (relative) independence of the 


by 


currences 
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will that renders them explicable in terms 
of stable or transient attributes. For exam- 
ple. the occurrence, “P's success on a task,” 
may be explained by this person's attributes 
of competence or diligence. while the occur- 
rence, "O's experience of fright,” may be 
explained in terms of the situational attri- 
bute of darkness. As the foregoing exam- 
ples illustrate, both persons and environmen- 
tal entities have attributes; therefore, in ex- 
plaining occurrences it seems justifiable to 
distinguish between personal (internal) and 
environmental (external) causality. Indeed, 
Weiner, Frieze, Kukla, Reed, Rest, and 
Rosenbaum (1971) made extensive use of 
the internal-external distinction in their at- 
iributional analyses of success and failure 
occurrences, But for actions (as opposed 
to occurrences) the internal-external parti- 
tion appears less justifiable: All actions are 
commonly assumed to be determined by the 
will, and the will is internal to the person; 
hence, all actions are determined internally. 
This conclusion serves as 2 departure point 
for an attributional theory of actions de- 
veloped in the remainder of this paper. 


ExpoGENOUS-ÉNOGENOUS DISTINCTION IN 
NarvE EXPLANATION OF ACTIONS 


In the present section a theory is devel- 
oped about the inferences that may follow 
from causally attributing actions in two con- 
trasting ways, designated here as endoge- 
nous and exogenous. The theory will sub- 
sequently be applied to diverse attributional 
findings and will be contrasted with alterna- 
tive interpretations based on the internal- 


external framework. 


PosTULATE 1. For the naive attributor, 
actions are explicable in terms of reasons or 
purposes. That is, the lay explanation of 
actions is teleological rather than causal. 
Derivation 1 (from Postulate 1). [n- 
sofar as all actions are (assumed) purposive, 
any action is either its awn end or is @ means 
] some further end. Now it 
that the means-goal catego- 
lly distinct and mutually 


to attaining 
should be clear 
ries are concepttia 
exclusive: The sense 
classified a means is 
nongoal. This is not to say that for any ac- 


in which something is 
one whereby it is a 
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tivity its means versus end status may be 
clearly ascertained. An activity may be 
means relative to Goal X and an end relative 
to Goal Y. For example, “riding a bicycle 
to school” may constitute a means of trans- 
portation, and at the same time an end of 
physical exercise. Of course, the above ex- 
ample does not imply that the concepts of 
means and end are nondistinct, any more 
than the concepts large and small are in the 
example of $1,000 being considered at the 
same time as large relative to the price of 
an ice-cream cone and small relative to the 
price of a mansion. 


DEFINITION 1, Let the term endogenous 
attribution of an action denote the case 
wherein an action is attributed to itself as 
a reason. In other words, an endogenously 
attributed action is an end in itself. 

DEFINITION 2. Let the term exogenous at- 
tribution of an action denote the case where 
the action is not an end in itself but rather 
is a means that mediates a further goal, one 
exogenous to it. 


PosruLATE 2. All purposes or intentions 
are internal to the individual, A goal or an 
intention is internal in that it presupposes 
the person. Some intentions may arise from 
man's biological nature (hunger, thirst), 
others (achievement, altruism) may arise 
from internalized social norms, 

DERIVATION 2 (from Postul 
different goals are not divisibl, 
ternal and external categories. The latter 


would simply be a null Set. It is of interest 
to note that the use of the i 


ate 2). The 
e into the in- 


ings has typically lacked e 
Rather, i 


an arbitrary 
terize some Purposes in a * 
(that has an "interna]" 
§ Purposes in a "goal" 
(that feels “external”, ‘or instance 
the attributional analyses of findir 
motivation (eg., Deci, 1971) d 
terest in the task has been fi 
ferred to as the interna] Cause of task per- 
formance and the contingent Monetary re. 
ward as the external Cause, But one could 
readily reverse the labels and call the task 


anguage 
nd other 
Or object language 
in 
ngs On task 
ae actor’s in- 
requently re- 
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external and the desire for money aie 
Obviously, to call a purpose “money in 
"desire for money" hardly alters the et 
that one is dealing with precisely the san 

urpose. . ; 

» Corollary to Derivation 2. The on 
of the internal-external dimension in P 
case of actions simultaneously denies 
possibility of any other variables being sys a 
matically related thereto. Thus, the action i 
degree of uniqueness (the extent to que 
it was uniquely undertaken by the actor vs. 
commonly performed by most people), as- 
sumed by the attribution theorists to be 4 
major basis for internal inferences, should 
not, in fact, relate to internality, Nor may 
the inferences of intrinsic motivation, sub- 
jective freedom, positive attitudes, or sincere 
reporting relate to internality. In turn, this 
implies that the uniqueness variable should 
not be expected to correlate with any of the 
inferences just listed. Experimental evi- 
dence pertinent to this assertion is reviewed 
in a subsequent Section, but now let us turn 


to the explanatory Properties of the endoge- 
nous-exogenous distinction, 


Functional Leve] of Explanation 
Covariance Level) 


13 (from Definition 1). An 


"s goal but will not covary 
An action that consti- 
f is completely expli- 
is end, provided that 
tionally possible (e.g. 
motivated eating could not 
ss food was situationally pres- 
fnt). More Specifically, given both the ex- 
'stence of some goal and the situational pos- 
sibility of its consumption, the endogenously 
Motivated consummatory) action will be 
Performed regardless of other situational 
t affect neither the presence of 

the goal nor the Possibility of its attainment. 
ERIVATION 4 (from Definition 2). 4” 
exogenously attributed action «will covary 
wth a specific conjunction of the goal and 
the situation, By contrast to an endoge- 
nously Motivated action, an exogenously mo- 
tivated one is not fully explicated in to 
of its goa] (and the situational possibility © 


tutes an end ; 
cated in terms of th 
this action was situa 
endogenously 
take place unle 


n itsel 


l 
^ 
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the action). An exogenous action assumes 
a greater degree of situational determina- 
tion: The situational constraints implicate @ 
given means (to some end) to be the best 
available. In Situation A, the means to 
having breakfast" may consist of walking 
to the kitchen; in Situation B, of driving to 
the hamburger stand, etc. 
Y Derivations 3 and 4 above imply a sense 
in which exogenously motivated actions are 
more externally (or situationally) deter- 
mined than are endogenously motivated ones. 
This may explain, perhaps, why attribution 
theorists made the presently denied dis- 
tinction between internal and external goals 
(such as "interest in a task" vs. the “con- 
tingent payment” ) where the endogenous- 
exogenous partition would seem applicable. 
In any event, while endogenously motivated 
actions may, in a sense, be also more purely 
internal than exogenously motivated actions, 
it is the endogenous-exogenots variable 
(rather than the internal-external one) that 
Seems crucial to the several inferences from 
actions that have been demonstrated experi- 
mentally. We now turn to the explicit deri- 
vation of these inferences. 
Derivative Level of Explanation 
Derivation 5 (from Definition 1). To 
the extent that an action is endogenously at- 
tributed, it implies the actor's positive affect 
(enjoyment, contentment, satisfaction, etc.)- 
This follows from the common definition of 
the end concept as a state of affairs that the 
actor desires. Its attainment (an endoge- 
nously attributed action) represents the ful- 
fillment of one's desire, à situation creditable 
with positive feelings of enjoyment and sat- 


isfaction. 
DERIVATION 6 (from Definition 25. “Lo 
the extent that an action is exogenously at- 
tributed, it implies the actor's negative af- 
fect, insofar as means is not something that 
the actor desires. In the case of an exoge 
nously motivated action, the actor is doing 
something that he does not want to do: 
hence he may not be assumed happy at the 
time. : 
Note that the endogenous-exogenous vari- 
able is not assumed to constitute the exclu- 
sive determinant of satisfaction or positive 
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affect, An action is endogenous Or exoge- 
nous with respect to a single goal, while the 
degree of satisfactior _dissatisfaction is as- 
sumed contingent on the totality of the per- 
son's goals or needs : notably, on the degree 
to which they are fulfilled or frustrated. 
Thus, the present Derivations 5 and 6 need 
to be supplemented by a ceteris paribus pro- 
viso, whereby the inferences of satisfaction 
versus dissatisfaction are a monotonic func- 
tion of the endogenous-exogenous variable 
only when the state of the person's remain- 
ing goals (needs) is held at a constant level. 


PosrULATE 3. Subjective freedom consists 
in the attainment of goals. One does not 
freely choose to have needs. These are de- 
termined for the individual by his biological 
constitution, societal norms, ethical injunc- 
tions, etc. But granting the existence of 
needs, freedom consists in need fulfillment 
or goal attainment. 

DERIVATION 7 (from Postulate 3 and 
Definition 1). To the extent that an action 
is endogenously attributed, it affords the in- 
ference of subjective freedom. 

Derivation 8 (from Postulate 3 and Defi- 
nition 2). To the extent that an action is 
exogenously attributed, it affords the infer- 
ence of compulsion. In the case of means 
(vs. end) the actor js not engaging in an 
action that is inherently desirable: He does 
not want to, only he is compelled to do it by 
the existence of his need and by the situa- 
tional constraints that pose barriers in the 
way of immediate need satisfaction. 

The analysis of inferred freedom con- 
tained in Postulate 3 and Derivations 7 and 
g seems closely akin to Steiner’s notion of 
“outcome freedom,” which he defined as 
“high (subjective) probability of obtaining 
desired outcomes” (1970, p. 189). Obvi- 
ously, maximal subjective probability of ob- 
taining à desired outcome is experienced 
upon this outcome's actual attainment, that 
is, in the case of consummatory OT endoge- 
nously attributed behaviors. Steiner's “de- 
cision freedom” is said to obtain when the 
individual may believe that “he rather than 
other people, fate or the press of circum- 
(1970, p. 189) may be credited with 


stances" 
Steiner goes on to illustrate 


his decision. 
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high (vs. low) decision freedom by bris 
istence of an attractive (vs. an unattrac| E 

alternative to the person’s caa ns 
This strongly suggests that “decision ree- 
dom” is also reducible to endogenous attri- 
bution. Thus, an unattractive alieiativë 
implies an exogenous attribution : The per- 
son did not choose the action for its own 
sake but rather to avoid the unpleasant al- 
ternative. Conversely, when the alternative 
is attractive, endogenous attribution seems 
indicated, there being no apparent exogenous 

a for the action. 

a 9 (from Definition D. In 
the case of endogenously attributed actions, 
knowledge of the action’s content immedi- 
ately identifies the action's goal or intention, 
For instance, the knowledge that a benefit 
bestowed on another person was endoge- 
nously motivated yields the immediate in- 
ference that the actor’s underlying 
was benevolence, 

Derivation 10 (from Definition 2). [j, 
the case of exogenously attributed actions, 
knowledge of their content does not reveal 
their underlying intentions. The specific 
means chosen (the content of which is 
known) could, in principle, mediate all pos- 
sible goals, its choice being situationally de- 
termined and not uniquely determined by the 
underlying goal, 

This concludes the present statement of 
the theory of endogenous attribution. The 
theory will now þe applied to the several 
areas of empirical research pertinent to 
lay explanation of actions, 
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in a task) as generically internal. The sec- 
ond criterion assumes that the attributor 
systematically applies the uniqueness stan- 
dard to arrive at internality designations: 
That cause is considered internal whose ef- 
fect is unique to the actor versus being com- 
mon to many persons (Kelley, 1967). E 

The present theory of endogenous attri- 
bution suggests that in explaining actions 
the layman draws the endogenous-exoge- 
nous distinction between the actions’ causes 
(reasons) rather than the internal-external 
distinction. In the following review of at- 
tributional research on actions, the case for 
the present theoretical scheme will rest on 
both conceptual and empirical considerations 
that favor the former over the latter parti- 
tion. On the conceptual level, two difficul- 
ties with current applications of the internal- 
external framework will be identified: the 
problems of (a) criterion and (b) inference. 
The problem of criterion is that the two 
ways of rendering internality-externality 
attributions implied earlier (the ideas of 
generically internal and external causes and 
of the uniqueness standard) do not appear 
to coincide, as might be hoped for, but rather 
seem to be mutually independent. Thus, any 
Senerically internal cause could be implicated 
as external on grounds of nonuniqueness, 
and any senerically external cause as in- 
ternal. By contrast, the Conceptual criterion 
for the endogenous-exogenous distinction is 
relatively straightforward, as it rests on the 
common differentiation between means and 
ends, 

The problem of inference is that in typical 
applications of the internal-external frame- 
Work, it is neither explicitly elaborated go 
is it otherwise plain how the attributions 9 
internality (externality ) may imply the vari- 
ous inferences with which they have beet 
credited. p. contrast, the present Deriva- 
tions 3-10 Specifically deduce the various pi 
ferences afforded hy the endogenous-exost 
nous classification, g 

The main em 
Port of the 
ments that 
wh 


"— 
pirical considerations i. es 
present theory include the ne 
(a) in all those i aine 
ere the subjects? inferences were co mä 
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the theory of endogenous attribution; (b) 
specific experimental attempts to vary the 
internal-external factor (either by manipu- 
lating the uniqueness variable or the generi- 
cally internal or external causes) failed to 
yield the inferences claimed to be implied 
by this partition ; and (c) experiments that 
specifically varied the endogenous-exoge- 
nous factor yielded results that unexception- 
ally corroborated the present theory. 

Let us now directly turn to attributional 
research on the lay explanation of actions. 
The specific topics to be reviewed include 
(a) intrinsic motivation, (b) subjective 
freedom, (c) benevolent and malevolent in- 
tentions, (d) inferred sincerity, and (e) es- 
cape behavior. 

Intrinsic motivation. Several recent ex- 
periments have demonstrated a negative re- 
lation between the existence of contingent, 
frequently monetary, reward and the sub- 
ject’s degree of intrinsic motivation for 
the task (eg., Deci, 1971, 1972a, 1972b; 
Greene & Lepper, 1974; Kruglanski, Alon, 
& Lewis, 1972; Kruglanski, Friedman, & 
Zeevi, 1971; Kruglanski, Riter, Amitai, 
Margolin, Shabtai, & Zaksh, 1975; Lep- 
per, Green, & Nisbett, 1972). These find- 
ings have been sometimes interpreted to 
mean that the actor infers his own intrinsic 
motivation when he attributes his perform- 
ance of the task internally (cf. Deci, 1974), 
for example, to his interest in the task. The 
inference of intrinsic motivation was as- 
sumed less likely in the reward versus the 
no-reward condition, because the former case 
but not the latter also contains plausible ex- 


ternal reasons for task performance, that is, 


the contingent rewards. Such analyses as 
the above have usually failed to clarify the 
criterion for designating the contingent re- 
ward (e.g., the money) as the external cause 
of the action. Rather, in these cases the 
cause's generic nature Was somehow intuited 
to belong in the external category. Alterna- 
tively, the widespread appeal of money could 
justify its alleged externality on grounds of 
its nonuniqueness as à motivator. It should 
be recognized, however, that the two criteria 
just implied are orthogonal and may not be 
expected to manifest unexceptional corre- 
spondence. For instance, à person could be 
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uniquely greedy and in return for money 
undertake a task rejected by everyone else. 
From the uniqueness perspective this would 
establish money as an internal cause of this 
person's action, that is, contravert the sug- 
gestion that money is generically external. 

Furthermore, the various analyses of in- 
trinsic motivation in terms of the internal 
attribution fail to cogently rationalize the 
implied relation between the two. Yet, 
counterexamples easily come to mind. For 
instance, our greedy individual above would 
seem to be internally motivated (on unique- 
ness grounds), yet would seem quite un- 
likely to be credited with a great deal of 
intrinsic motivation toward the task that he 
performed for money. The generic standard 
for internality-externality is similarly sus- 
ceptible to counterexamples with respect to 
the inferences of intrinsic motivation, On 
this criterion money would represent a typi- 
cally external cause, $0 that behaviors 
prompted thereby should lead to the infer- 
ences of extrinsic motivation expressed, for 
example, in disenjoyment or dissatisfaction. 
But acts to which the monetary rewards are 
inherent, such as playing poker, perform- 
ing a business transaction, or receiving one’s 
paycheck, are certainly attributable to mone- 
tary causes without implying disenjoyment. 

The last examples naturally lead to the 
present conceptual framework, whereby the 
inference of intrinsic motivation follows from 
an endogenous attribution (Derivation 5" 
Whenever the actor's performance of his 
task appears endogenously motivated (i.e. 
task performance appears to be an end in 
itself) the implication of enjoyment, interest, 
etc. derives from the notion of goal attain- 
ment (consummatory behavior). By con- 
trast, when there exists a plausible exoge- 
nous reason for task performance (the ac- 
tor's reason is not inherent to the task but 
is made arbitrarily contingent on task per- 
formance), the attributor will partially dis- 
count task-endogenous reasons; hence the 
inference of enjoyment, interest, etc. will be 
less likely to follow. Thus, the present anal- 
ysis suggests that the inference of extrinsic 
motivation in the reward condition does not 
follow from (a) the fact that money was the 
zoal of one's activity nor (b) the fact that 


> 
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'TABLE 1 


AVERAGE DEGREES OF INTRINSIC MOTIVATION 
E (ExPERIMENT 1) 


Money endo- — Money exo- 
Condition genous genous 
Payment present 5.44 4.66 
Payment absent 2.25 5.60 


i denote a greater degree of intrinsic moti- 
AR From “Can Money Enhance Intrinsic 
asco A Test of the Riter A, Comit qe Hypothesis 
i K d. A. - Margolin, L, 
ENDE E l of Personality and Social Psy. 

i, and D. Zaksh, Journal of Personality a ti 

PEN. 1975, 31, 744- by the American 
Psychological Associati ission. 


the goal was nonunique; but rather that the 
activity was exogenously attributed, that is, 
was a means, rather than an end in itself, 
Recently, this author has completed two 
experimental Sequences of direct relevance 
to the discussion above. 


money, a cause 


al to the person, 
activity, in which 


of intrinsic Motivation is grea 
effect (action) js attributable 
rather than exogenously, 
sider at some length the experimental eyi- 
dence for these Propositions, 

In the first experimental 


endogenously 
Let us now con- 


monetary rewards were 
ask content 
while in the 
ated to the task 
). Within each 


condition), 
second case they were unrel 
(money-exogenous condition 
of these conditions half of the subjects per- 
formed the task in return for monetary re- 
wards and the remaining half for no re. 
wards. Specifically, in the money-endoge- 
nous condition of Experiment 1 the subjects 
played with the experimenter 4 Coin-toss 
guessing game (heads or tails). Ag this 
game is usually played for actual money, it 
had been assumed (and confirmed on subse- 
quent checks) that money would p 
hended as a Same-endogenougs feature. 
the money-exogenous condition i] 


y ) 1€ subjects 
played with the experimenter 4 game of 
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model construction, an activity hardly d 
associated with money. In each of the - 
conditions half of the subjects receive 

monetary rewards for correct performance 
(the payment-present conditions), and the 
remaining half received points (payment- 
absent conditions), The degree of intrinsic 
Motivation was inferred from the appropriate 
index based on the subjects' reported inter- 
est in the game and preference for it over 
Possible alternatives. The pertinent data 
are displayed in Table l. As shown, when 
money is endogenous to the task its presence 
enhances intrinsic motivation (p < .001), 
Whereas when it is exogenous to the task its 
presence depresses intrinsic motivation 


.05)—this in contrast to the implication that 
intrinsic motiy; 


appears in evidence, 
The confidence in the 


above results is aug- 
mented by the 


findings of Experiment 2, 
where the money-endogenous and money- 
exogenous tasks were less dissimilar than 
their counterparts in Experiment 1. Spe- 
cifically, in both Same conditions of Experi- 
ment 2 the actual activities were 
identical, only they were interpreted differ- 
ently: as stock market Operations in the 


Conditions and as ath- 


in the money-exogenous 
in Experiment 1, the sub- 


letics management 
i As 


TABLE 2 
GAME PREFERENCES AND RATED INTEREST 
(EXPERIMENT 2) 
——————— — 
n 
Money en- Money ex- 
Measure dogenous ogenous 
e emo A 


Proportions of subject 
Preferring the same 
vs. another) game 


s 


Payment present 


65 35 

Payment absent 35 5 
Average rated interes; 

Payment present 5.75 443 

ayment absent 4.31 5.30 


Note. Higher figures de i d interest on a ^; 
Point scale, From “4 an Money higher rated int Mote 
wre of the Content Consequence Hypothesis" by i and 
uslanski, A. Riter A souital, B. Nargolin, L. Shabtal, ant! 
ispoUtnal of Personality bs; Social Psychology, PE 
opyright 1975 by the American Psyc 
printed by permission, 


E 744-750. 
Ass ‘Ociation. Re 


ATTRIBUTION THEORY 


jects’ successful performance (accumulation 
of stock or athletic credits) was rewarded 
by money in the payment-present conditions 
and by points in the payment-absent condi- 
tions, Intrinsic motivation was inferred 
from (a) the proportions of subjects who 
preferred to continue with the same game 
versus trying an unknown alternative, and 
(b) interest and enjoyment ratings. The 
relevant data are shown in Table 2. As ex- 
pected, a significantly greater proportion of 
the subjects preferred the money-endogenous 
game in the payment-present condition (P 
< 01) and preferred the money-exogenous 
game in the payment-absent condition ($ < 
02). Similarly, the money-endogenous 
game was rated as significantly more inter- 
esting in the payment-present condition (f 
< .01), whereas the money-exogenous game 
was rated as more interesting in the money- 
absent condition (p «.05). In sum then, 
the findings of both experiments are consist- 
ent with the present theory of endogenous 
attribution (specifically with Derivation 
and inconsistent with the idea that intrinsic 
motivation follows from internal attribution 
and is lowered by the presence of a generi- 
cally external cause such as money. 

'The second series of studies (Kruglanski, 
Note 1) examined the uniqueness standard 
for internality and the endogenous-exoge- 
nous variable as they may affect the infer- 
ences of intrinsic motivation and freedom. 
In the first experiment the subjects were 
furnished with information regarding the 
target person's uniqueness on some charac- 
teristic, In one half of the cases the target 
person was presented as unique on that char- 
acteristic and in the remaining half of the 
cases as nonunique. The target person was 
then portrayed to have acted in some man- 
ner, such that in one half of the cases the 
individual's unique or nonunique character- 
istic suggested an endogenous attribution, 
while in the remaining half of the cases it 
suggested an exogenous attribution. 

Thus, in one of the four hypothetical situ- 
s presented to the subjects, the actor, a 


ation s 
high school student, decides to attend a Sun- 


day concert performed by an out-of-town 
classical ensemble. In the endogenous con- 
dition the actor is known for his delight in 
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TABLE 3 


ENJOYMENT AND FREEDOM RATINGS 
(EXPERIMENT 1) 


Condition 
Endogen- Exogen- 
Measure ous ous 
Inferred enjoyment 
Internal condition 6.38 4.00 
External condition 6.31 3.87 
Inferred freedom 
Internal condition 6.57 3.88 
External condition 6.15 3.88 


Note. The higher the figures, the greater the inferred enjoy- 
ment or freedom. From Kruglanski (Note 1). P 


classical music, which is either unique to him 
or is common among his fellow students. 
In the exogenous conditions, Ann, an at- 
tractive new girl, is also known to attend 
the concert, and the protagonist has an in- 
terest in Ann that is either unique to him 
or is commonly shared among the boys at 
his school. The subjects responded to four 
hypothetical situations in accordance with a 
balanced Latin square design, where each 
situation represented one of the four possible 
combinations of the endogeneity and the 
uniqueness variables. After reading each 
text, the subjects answered a brief ques- 
tionnaire that included checks on the unique- 
ness manipulation and questions about the 
actor's perceived enjoyment of the action 
(e.g. listening to the concert) and his sub- 
jective freedom while pursuing it. It will 
be noted that the uniqueness manipulation 
was highly effective: The subjects rated the 
protagonist's actions as more distinctive of 
his person in the unique (vs. the nonunique ) 
condition (? € 001). The enjoyment data 
(collapsed across the four situations) are 
displayed in the upper row of Table 3. As 
shown, greater enjoyment of the activities 
was reported in the endogenous (vs. the 
exogenous) conditions ( € 001), while the 
uniqueness manipulation yielded no effects 
with respect to task enjoyment. 

The possible argument could be raised 
that the uniqueness variable did not affect 
the results of Experiment 1 because the 
actor’s reason for his behavior was clearly 


identified. Verhaps uniqueness may signify 
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TABLE 4 
ENJOYMENT AND FREEDON RATINGS 
(EXPERIMENT 2) 
Condition 
Endo- Exogen- 

Measure Internal External genous ous 
Inferred ) 
E oum 5.37 5.0 645 3.87 

ferred 
oris 3.594. 5.50 6.36 4.29 


i i he inferred enjoy- 
te. The higher the ratings, the greater t 
aom freedom. From Kruglanski (Note 1). 


person's specific 
reason for his action is not otherwise im- 


The foregoing Possibility was ex- 


exogenous. Addi- 
tional information Was provided that sug- 
gested the actor's reason to be either (a) the 
(b) the exogenous one, 
actor, or (d) nonunique 
to him. (Note that in the latter two Cases 
the sole information Provided was about the 


uniqueness of the action without the specific 
Teason being identified in any way.) 

hus, in one of the hypothetical situa- 
tions i 


for dessert (endogenous reas 
order), exactly 
moment ago 
order), 
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Properties). (d) 
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summarized in the first row of Table 4. 


Again, the endogenous-exogenous er 
strongly affected the results but not the 


uniqueness variable. In particular, the a 
tor’s inferred enjoyment was (a) higher in 
the endogenous case than in either the in- 
ternal or the external cases ($ < 01), (b) 
lower in the exogenous case than in either 
the internal or the external cases (p < .05), 
and (c) no different in the unique and the 
nonunique conditions. 

In Experiment 3 : 
assessed the information value for enjoy- 
ment and freedom 
Posing the choice between (a) endogenous 
versus exogenous 
Sonal versus 


stance, Kelley (1967 
movie viewer who wonders whether 


„1€ Present case, One of th 
thetica] situations used inE 
trayed a Protagonist, X, 


attend movie In the Personal versus en- 


€ four hypo- 


' Persona] Preferences or with the 
Properties, In the endogenous- 


TABLE 5 
Raten INFORMATION VALUE OF INTERNAL-Ex- 
TERNAL VERSUS ENDOGENOUS-Exocrnovs 
HOICES FOR ENJOYMENT AND 
REEDOM INFERENCES 


(EXPERIMENT 3) 
Se hl. 
Se ete 


Condition 


Endogenous- 


Measure pm exogenous 
f c d 
Enjoyment P 
inf 3.62 5.12 
inference :62 
"reedom 
inference 3.54 5.41 


Note. The hi her t i 
forti The hi T the figures, 


hi ived in- 
F the greater the perceived 
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exogenous version the question to be as- 
sessed was of whether X’s reason for at- 
tendance had to do with the movie’s prop- 
erties or with the fact that his boss had 
invited him to come along for this movie. 
Each of the three hypothetical situations 
Was presented to the subjects twice in a 
counterbalanced order: once with the per- 
sonal-environmental question and once with 
the endogenous-exogenous one. The perti- 
nent results (collapsed across the three situ- 
ations) are displayed in the upper row of 
Table 5. As shown, the subjects rated the 
endogenous-exogenotts question as signifi- 
cantly more informative for enjoyment in- 
ferences (p < 01) than they did the per- 
sonal-environmental question. 

Finally, relevant to the present concern is 
the “forced-compliance” study by Cooper, 
Jones, and Tuller (1972), where uniqueness 
versus nonuniqueness of a person's compli- 
ance to deliver a counterattitudinal essay on 
some issue was orthogonally crossed with 
the magnitude of exogenous reward prom- 
ised him for doing so. It turned out that 
a more positive attitude on this issue Was 
expressed by subjects in the case of a small 
exogenous reward (versus a large one), that 
is, where endogenous attributions would 
seem more likely. By contrast, the act's de- 
gree of uniqueness had no appreciable effects 
on attitudes. 

Subjective freedom. It has been argued 
that freedom or volition of the actor may be 
inferred from an internal attribution, that is, 
to the extent that the act may be attributed 
to the person uniquely. For example, Kelley 
(1967, p. 218), suggested that *you can be 
said to exercise choice only if you do some- 
thing different from what everyone else 
would do in the same situation." But it 
would seem that some nonunique choices 
may appear (and be experienced) as quite 
free, and some unique ones as unfree. For 
instance, Suppose that a millionaire, no- 
torious for his eccentricity (say Howard 
Hughes ) approached you one day with the 
free gift offer of $1,000,000. Most of us 


would probably hasten to accept the gift 
without feeling unfree in the least. On the 
other hand, a person's unique fear of au- 


thority might prompt him to uniquely abide 
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by unreasonable regulations, unheeded by 
anyone else, and experience a strong sense 
of compulsion while doing so. 

On the level of empirical evidence, Krug- 
lanski and Cohen (1973) claimed specific 
support for the uniquely personal rendition 
of perceived freedom. But a closer scrutiny 
of these authors’ procedures suggests that 
their findings may better be explained by 
reference to the endogenous-exogenous dis- 
tinction. Kruglanski and Cohen (1973) 
operationalized person causation by the ex- 
perimental condition where the target per- 
son's act (choice of an essay topic) was 
consistent with his unique character versus 
being inconsistent with his character in the 
external-causation case. Subjects attributed 
significantly greater freedom to the target 
person in the in-character versus the out-of- 
character condition. 

But rather than viewing the in-character 
condition as one that operationally defines 
uniquely-personal causation, and the out-of- 
character condition as one that defines the 
absence of unique causation, it is possible to 
interpret the former as implying an endoge- 
nous attribution and the latter an exogenous 
one. Thus, in the in-character condition, 
knowledge of the protagonist’s preferences 
suggests that the act produced (choice of 
an essay) was endogenously motivated, 
whereas in the out-of-character condition the 
same prior knowledge suggests that the act 
may not have been its own end; hence some 
aspect exogenous to this act must have been 
the reason for its performance. Thus, in the 
Kruglanski and Cohen experiment, uniquely- 
personal causation was confounded with en- 
dogenous attribution, and either conceptuali- 
zation could explain the findings. But in 
the experimental series by Kruglanski (Note 
1) described above, freedom inferences (as 
well as those of task enjoyment) were sig- 
nificantly (^ < .01) affected by the endoge- 
nous-exogenous distinction but not at all 
affected by the uniqueness variable (Experi- 
ments 1 and 2). Furthermore, questions 
posing the endogenous-exogenous choice 
were rated as significantly ($ < .01) more 
informative for freedom inferences than 
questions posing the internal-external choice. 


The data relevant to the above assertions 
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are displayed in the lower rows of Tables 3, 
4, and 5. Finally, it is of interest to note 
that within Experiments 1 and 2, the ratings 
of freedom and enjoyment exhibited a high 
degree of positive correlation, as did the in- 
formativeness ratings for freedom and en- 
joyment inferences in Experiment 3 (P< 
-01 in all cases), a finding consistent. with 
the idea that freedom and enjoyment infer- 
ences follow from the same (presumably) 
endogenous attribution. 

To conclude, then, both Previous results 
and i 


vation inferences, 

Benevolent and malevolent intentions, The 
rendition of intentions as bene 
levolent has been analyzed p 
the internal-external distinction, 
cal experiment, the actor 
or a harmful deed (a) 
where an external reason ; 
in the 


volent or ma- 


uded the Status of 
& Riecken, 


experimenter's instruction to per- 


rule; a f 


Person upon Whose 
) +. an inadequacy 
in the prior behavior of the person Subjected 
to the frustration” (Kelley 

It turns out (see K 
view) that in the ab 
reasons, a helpful o 
greater degree of i 


target person, as been inter- 


that internally attributed 
benevolent or malevolent acts implicate be- 
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nevolence or malevolence as their underlying 
intention, e 

The diversity of external reasons illus 
trated above raises again the question of Cs 
terion: What common aspect of these apo 
ent justifications may account for their Es 
ternality (or the internality of others). 
Little specific discussion of this problem ex- 
ists in the literature, but it is possible to 
apply here our two earlier Wap gto 
whereby the attributor may apply either ud 
intuitive criterion of generic internality (ex- 
ternality) or the more systematic standard 
of uniqueness, It is worthwhile to reiterate 
that the two interpretations need not coin- 
cide. For instance, a person could be par- 
ticularly Sensitive to social status, and hence 
$9 to unique pains to please a high-status 
This would Suggest internal causa- 


S of uniqueness, even though 
the cause in question (status) has been cus- 
tomarily referred to as external (cf, Thibaut 
& Riecken, 1955) 


1 respect to the in- 
ternal-externa] partition : Internal attribu- 
tion has been assumed to mediat 
Tocation of one’s helpful or harm ts, 
it seems unlikely that an actor’s 
actment of 4 benevole 
in much reciprocation 
that deed is assignable 
usual craving for Status, 


In the empirica] domain, inexplicable in 
terms of the uniqueness principle is the per- 
sistent finding that less reciprocation follows 
when the actor's helping behavior seems in- 
appropriate, hence Suggesting “that the actor 
may be peculiar, stupid, or not completely 

197 1a, p. 16). Clearly, in- 
al behaviors (whether 
or malevolent) are ipso facto 
hence should increase the prob- 
ability of interna] attribution, yet low recip- 


Cation resulted, Kelley viewed such find- 
Ings as "exception[s] to the rule that helpful 
or affiliative behavior attributed to the per- 


t more liking than if attributed 
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viewed as exceptional. Let us, therefore, 
consider the findings on reciprocated harm 
or benefit in light of the latter distinction. 

It is presently proposed that the presence 
of justifying reasons represents the case 
wherein the act (the benefit or the harm) 
may not be readily assigned endogenously, 
but rather may be perceived as a means to- 
ward the attainment of an exogenous goal 
(the justifying reason). Now, when the 
act's content is known to be benevolent or 
malevolent, an endogenous attribution im- 
plicates the actor's intention as benevolence 
or malevolence (Derivations 9 and 10). It 
is the latter inference of intention that may 
elicit reciprocation from the acts target. 
The above analysis suggests, for example, 
that the benevolence inferences and recipro- 
cated benevolence (assumed to follow from 
an endogenous attribution of a benevolent 
action) ought to be correlated with the in- 
ferences of the actor's freedom (assumed to 
follow from an endogenous attribution of 
any action). Indeed, in at least one experi- 
ment such correspondence was reported. 
Specifically, Thibaut and Riecken (1955) 
found that when a helpful act was performed 
in the absence (vs. the presence) of an ex- 
ogenous reason (approval by a high status 
figure) the actor was (a) credited with 
greater freedom (was perceived as wanting 
to rather than being coerced to help) and 
also was (b) liked more (reciprocated be- 
nevolence). 

As noted earlier, the uniqueness criterion 
for internality confronts a special difficulty 
in the case of irrational behaviors (or those 
associated with a psychological defect in the 
actor). Such behaviors are highly unique 
(which implies internality) yet have typi- 
cally resulted in low reciprocation—reserved 
for externality. Therefore, of particular in- 
terest is the possibility that the theory of en- 
dogenous attribution may handle the case of 
irrational or pathological acts. Specifically, 
the irrational actor could misperceive the 
situation and include in his evaluation a 
variety of exogenous justifications evidently 
absent for a more rational perce? (es. 
the pa ranoiac may inappropriately iie « 
the other person as boding him ill; such * 
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evaluation could then provide a possible ex- 
ogenous reason to which his aggressive acts 
could be attributed). Similarly, when per- 
iormance of a given act seems clearly in- 
appropriate, the actor could be suspected of 
misperceiving the meaning of this act. This 
would render inapplicable Derivations 9 and 
10 of the present theory, thus delimiting the 
implications of endogenous attribution to the 
inferences of freedom and motivation to per- 
form the act as such. But in the absence of 
clarity that the actor understood the act as 
benevolent or malevolent, an inference that 
the actor's intention was to bestow benefit 
or harm is not unambiguously indicated. 

To sum up, then, it has been argued that 
the data on reciprocated harm and benefit 
are accountable for within the endogenous- 
exogenous framework and not readily ex- 
plicable in terms of the uniqueness variable 
or the internal-external partition as such. 
An endogenously motivated favor or harm 
implicates not only the actor’s volition (as 
in the Thibaut & Riecken, 1955, study) but 
also his intention as benevolence or malevo- 
lence, inferences which, in turn, may me- 
diate reciprocal behavior from the act's 
target. 

Inferred sincerity. In a widely known 
argument, Daryl Bem (1965, 1967) ac- 
counted in attributional terms for the nega- 
tive relation between the magnitude of in- 
ducement for making a counterattitudinal 
statement and subsequent attitude change 
toward the statement's content (cf. Festin- 
ger & Carlsmith, 1959). To that end Bem 
employed Skinner's (1957) distinction. be- 
tween tact and mand, where tact has been 
defined as “a verbal response . . . under the 
discriminative control of some portion of the 
environment" and mand "as a verbal re- 
sponse under the control of a specific rein- 
forcing contingency" (Bem, 1965, p. 200). 
When the magnitude of inducement is high, 
the reward contingency is clear, hence a 
mand inference should be made, which im- 
plies that the statement may not have re- 
flected the (subjectively) true properties of 
the entity about which it has spoken. By 
contrast, with a low inducement a tact in- 


400 


ference is more likely, and it should afford 
the inference of a sincere report. — ] 
Note that the tact-mand distinction dif- 
ferentiates between portions of the (exter- 
nal) environment and assumes a causal ex- 
planatory scheme. In other words, a per- 
son's utterances are assumed to be caused by 
various parts of the environment or to con- 
stitute responses to environmental stimuli. 
This contrasts with the teleological account 
of actions (including utterances) proposed 
herein. But from a different vantage, there 
exists an important similarity between the 
tact-mand and the endogenous-exogenous 
distinctions. Tacting refers to utterances 
prompted by endogenous aspects of the state- 
ment's object, while manding implies utter- 
ances prompted by goals (e.g., reward con- 
tingencies) exogenous to the 
object. Now in the same way , end 
genously motivated benefit or harm implies 
the actor's end to be benevolence or malev- 
olence, so endogenously motivated reports 
may implicate sincerity (“telling it like it 
is") as their underlying goal (Derivations 
9 and 10). In other words, a report is a 
description of some reality, and an endo- 
genously motivated Teport implies that the 
actor’s sole objective is to describe a reality, 


that is, Teport on what to him appears to be 
the case, 


Statement's 
that endo- 


erent theme looms 
prominent in Kelley’s (1967) interpretation 
of the relation between inducement and atti- 
tude change, Kelley viewed as particularly 
significant the fact that “strong reinforce- 
ment contingencies Suggest the notion that 
most or all persons in the sam 
would make a response” (1967 


also emphasized the idea that 


€ situation 
| P. 217), He 


the judgment that the 
ment] complier js mori 
[high inducement] com 
with assumptions that 
opinion toward the tas 
vorable subjects complied in the [low ind 
case and almost all, favorable Or not, cor 
the [high inducement] case, (p. 226) 
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The foregoing analysis has two implications . 
1. The distinction Proposed therein is not 


between entity versus extrinsic Teward (tact 
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vs. mand), but rather between internally 
versus externally assigned causality as ^e 
lined from the uniqueness perspective. T d 
low-inducement case is assumed to suggest 
an internal attribution, whereas the high- 
inducement case is assumed to suggest 
an external attribution (as everyone in the 
same situation is assumed to comply). i 

2. Within the above analysis, internal or 
unique attribution of an utterance is assumed 
to reflect the speaker's sincerity or favora- 
bility to his pronouncement, 


But the 
attribution 


proposed linkage between internal 
and the inferences of sincerity 
of favorability has not been furnished with 
an explicit rationale, nor does it appear to 
be intuitively evident. On the contrary, the 
consensus criterion of attribution validity 
(Kelley, 1967) suggests that unique pro- 
nouncements (those that deviate from the 
consensus) would be considered false, thus 
(possibly ) insincere, Similarly, 

(hence, internally based) compliance need 
not reflect the actor’s favor 
deed but could, 


signify his 
Special craving for 


Sent reward, or 
his unique fear of the contingent punishment, 
Indeed, 


the above-mentioned 
Cooper et al. (1972), and by 

(Note 1) has failed to detect a relation 
between the uniqueness of an act and the 
actor's inferred favorability thereto, 

As another example of the subjects! al- 
leged reliance on the | internal-external 
Scheme in arriving at Sincerity judgments, 
Consider the experiment by Jones, Davis, 

(1961), In this study, a ficti- 

iewee for a job described himself 
that was either consistent or 
e job requirements. It 
Subjects were more prone to 
rViewee's statements in the 
J € former case, Tn interpret- 
Kelley Suggested that the at- 

s behavior was prompted by an in- 
" by an external one—the de- 

: acteristics of the job interview 
Situation, Kelley further argued that in 
those conditions Where the job applicant had 
Portrayed himself inconsistently with the job 
requirements “the behavior should be attrib- 


research by 
Kruglanski 


turned out that 


uted to the person himself and from it his 
properties should be inferred" (1971a, p. 9). 
But such an interpretation suffers from the 
by now familiar difficulties of the internal- 
external partition. First, the classification 
of role demands as external to the person 
lacks explicit rationale. For instance, com- 
pliance with role demands could be unique 
(hence, internally based) because of some 
actor's unique craving for role-associated 
rewards, Second, the suggestion that in- 
ternal (or unique) attribution leads to the 
inferences of sincerity is similarly unexpli- 
cated. As noted earlier, the uniqueness of 
an actors statements (ic. his deviation 
from consensus) could well intimate a de- 
ceptive intent on his part, rather than sin- 
cerity. 

A quite simple account of the Jones et al. 
(1961) results is possible in terms of the 
present theory of endogenous attribution : 
Behaviors inconsistent with the job require- 
ments are more readily attributable endoge- 
nously than behaviors consistent with the 
job requirements, which suggest the addi- 
tional possibility of an ulterior (exogenous ) 
purpose. Furthermore, characterization of 
the behavior in question as a self-report 
(ie, a description of some reality about 
oneself) allows, in the case of endogenous 
attribution, identification. of the speaker's 
specific intentions as that of describing one- 
self sincerely (Derivations 10 and 11). 

It is of more than curious interest that 
the situation concocted by Jones et al. (1961) 
easily lends itself to the mistaken uniqueness 
interpretation. for two incidental reasons: 
(a) The interviewee is making a statement 
about himself; in other words, endogenous 
aspects of the statement's content concern his 
unique person; and (b) the exogenous re- 
ward (getting the job) is something that 
appears to be of high general desirability 
(i.e. suggesting à nonunique and hence ex- 
ternal attribution). Thus, within this situa- 
tion endogenous attribution could be also 
characterized as attribution to the unique 
5 “person” constitutes here an kl 
ct of the statement's object 


atement is about the person). 
that the sin- 


person, à 
dogenous aspe 
(simply, the st 


Jut it is important to realize 


ATTRIBUTION THEORY 401 


cerity inference follows from attribution to 
the person as endogenous aspect of the state- 
ment’s object, not to the person of the 
speaker as such. Similarly, in this instance 
the exogenous reward (lowering the confi- 
dence in sincerity) is confounded with an 
implied external attribution. However, the 
crucial factor is that the attribution is ex- 
ogenous not that it is external (or non- 
unique). 

To recapitulate, the theoretical implication 
suggested by the preceding discussion is that 
verbal descriptive statements attributed en- 
dogenously are interpreted as sincere Or 
truthful. To the extent that an exogenous 
element of the situation (e.g. a contingent 
reward or threat) appears to constitute the 
statement's end, the statement will be inter- 
preted as less sincere. As in Our discussion 
of acts classified as beneficial or harmful for 
the recipient, so in the case of reports; en- 
dogenous attribution implies more than the 
actor's freedom and satisfaction with the be- 
havior (see Derivations 9 and 10). In this 
instance, characterization of the behavior as 
a report (or a description of a reality) al- 
lows one to infer the endogenous reason for 
such behavior, namely, one of reporting 
what, to the speaker, appears to be the case. 
It is precisely this latter assumption that 
may allow the interpretation of endogenously 
prompted statements as sincere. 

Escape behavior. In an experiment by 
Bandler, Madaras, and Bem (1968), sub- 
jects in one condition were led to believe 
that their escape of electric shock occurred 
under free choice, while subjects in another 
condition were induced to think that their 
escape Was mandatory. Subsequently, indi- 
viduals in the first category (free choice) 
rated the shock as more painful than did 
their counterparts in the second category. 
Nisbett and Valins (1971) accounted for 


these data in terms of the internal-external 
ce condi- 


division, arguing that in the choi 
tion, "it was expected that the subjects 
would interpret their escape behavior as 
self-produced : subjects should infer from 
this behavior that the shocks were indeed 
rather painful” (p. 2)- By contrast, in the 
no-choice condition, escape behavior is pre- 
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d to be under external (or environ- 
Pe) control (p. 3). But again the = 
ceptual difficulties with the lei ge 
distinction are vivid: (a) The unclarity o 
the criterion for classifying the experimen- 
ter’s demands (rather than, say, the shock) 
as external; (b) the possibility of the experi- 
mental demands providing a unique (hence, 
internal) motivation for a uniquely demand- 
sensitive actor ; and (c) the lack of apparent 
rationale for deducing the inference of pain- 
fulness from the attribution of internality. 
Thus, our demand-sensitive individual could 
uniquely (hence, internally) escape a shock 
because of someone’s request (unheeded by 
anyone else), yet no inference of pain is 
likely to be made in such an event. 

Within the present conceptual framework, 
escape in the choice condition can be co- 
ordinated with the notion of endogenously 
motivated activity : Cessation of the shock is 
an end in itself. By contrast, in the no- 
choice condition escape is less clearly its own 
raison d’étre, as there also exists the (ex- 
ogenous) experimental instruction to escape. 
Furthermore (as Derivations 9 and 10 sug- 
gest), knowledge that the act’s content was 
cessation of shock, allows, in the case of en- 
dogenous attribution, the inference that the 
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in Definitions 1 and 2 
an endogenous attriby 
action is considered 
and an exogenous 


s analysis is an 
given earlier, w 
tion is made wi 
to be its own 
attribution js mad 


chored 
hereby 
hen the 
Teason, 
€ when 


Arie W. KnUGLANSKI 


the reason is foreign (exogenous) to the 
action. The above definitions contain three 
distinct components : the action, its cause (or 
reason), and the relation (of endogeneity or 
exogeneity) assumed to connect the two. 
Correspondingly, the conditions of endoge- 
nous (exogenous) attribution are naturally 
classifiable into those affecting the action s 
Perceived identity, the action's perceived 
causality, and the reason's perceived endoge- 
neity to the action. In what follows, the 
causality category is considered first. This 
class of conditions may already claim Some 
empirical Support, whereas the remaining 
two categories (identity and endogeneity ) 
constitute at present novel theoretical pro- 
posals intended as heuristics to further in- 
quiry. 

The causality question, 
causal assignment is a general one within the 
attribution process and is not unique to the 
endogenous-exogenous distinction. This is 
the problem of choosing 
sal hypotheses advanced to expl 
fect. The attributor's 
hypothesis may be regulated by severa] prin- 
ciples identified by Kelley (1967, 1971a), in 

unting principle and that 
ia attributional criteria. 
iscounting principle, the 
confidence in any causal hy- 
pothesis ig inversely related to the number 
of hypotheses entertained, Thus, confidence 
1m an endogenous explanation will be higher 
in the absence (vs. the presence) of addi- 
tional, exogenous explanations of an action. 
Indeed, all of the findings reviewed earlier 
may be interpreted to illustrate the workings 


of the discounting Principle. In the basic 
experimenta] paradigm the actor in one con- 


ition appears to have only an endogenous 
reason for his behavior, while in another 
Condition he seems to have an exogenous 
reason as well, The recurrent finding that 
inferences deducible from the notion of en- 
dogenous attribution are stronger in the 
former (ys. the latter) case support, there- 


fore, the discounting principle of causal as- 
Signment, 


The discussi 


tion in the nur 


The problem of 


; ia- 
on thus far pertains to in 
; ; ses. 

nber of plausible hypothe 
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By contrast, given a constant number of hy- 
potheses, their relative validity may be as- 
sessed by reference to attributional criteria 
of corroborating evidence. More specifically, 
corroborating evidence for a given causal hy- 
pothesis could assume, for example, the 
forms of consensus information, consistency 
over occasions or consistency over modality 
(Kelley, 1967). Alternatively, corrobora- 
tion of a given hypothesis could be achieved 
via the convergence of several independent 
indicators (see, e.g. Ajzen & Fishbein, 1975; 
Campbell, 1963). For example, consensus 
information was manipulated by Kiesler, 
Nisbett, and Zanna (1969) in such a way 
that pronouncements by a confederate re- 
garding the reason for his behavior resulted 
in similar attributions by the subject. Con- 
sistency (over occasions) figured in the ex- 
periment by Kruglanski and Cohen (1973) 
reviewed earlier, where knowledge of the 
actor’s past preferences implicated a given 
reason as a plausible explanation of his cur- 
rent action. Finally, the convergence of in- 
formation could have been involved in the 
study by Nisbett and Schachter (1966). 
These investigators found that the judged 
intensity of shock-produced pain can be ma- 
nipulated by providing the individual with 
an alternative explanation for the effect 
(physiological arousal) he was experiencing. 
As it turned out, this effect obtained only 
in the low-fear condition of the Nisbett and 
Schachter experiment and not in the high- 
fear condition. According to the present in- 
terpretation, the high-fear condition may 
represent the case where several bodily com- 
ponents of fear (€g-, the sinking feeling in 
one’s stomach, the sweating of one’s palms) 
may converge on the hypothesis that shock 
was responsible for the arousal; hence, at- 
tribution to shock might override possible 
alternative explanations manipulated by the 


experimenters. 


'To summarize, 
sion of the causality problem suggests that 


endogenous attribution is likely to be ren- 
dered when (a) no exogenous reasons are 
ned by the attributor, and 


to be entertait 
vidence (e£ of the con- 


or convergence kind) 


then, the present discus- 


likely 
(b) when there is € 
sensus, consistency, 


that corroborates the endogenous explana- 
tion and/or falsifies exogenous explanations 
of the action. 

The identity question. The problem of 
causality, above, has received ample atten- 
tion within attribution theory. By contrast, 
little explicit discussion has been devoted to 
the problem of identity, that is, to the way 
an effect (to be causally explained) is con- 
ceptualized. It is presently suggested that 
the lay person's knowledge of the effect's 
identity is hypothetical in the same sense 
that his knowledge of causality is. In other 
words, an individual confronted with an 
event or a sequence of events is assumed to 
formulate hypotheses about the conceptual 
identity ( meaning) of these phenomena. 
'l'he foregoing implies that the same physi- 
cal events are amenable to several disparate 
identifications. Pertinent to the present con- 
cern, the same action may be conceptualized 
in a variety of different ways, and the spe- 
cific conceptualization that the attributor de- 
cides to adapt will influence the likelihood of 
the action being assigned endogenously or 
exogenously. 

For example, the same sequence of be- 
haviors might be variously interpreted to 
mean that the actor is (a) sweeping the 
floor, (b) practicing his shuffleboard game, 
(c) helping his mother, or (d) earning 
money. Obviously, when the action is con- 
ceptualized in one way an endogenous attri- 
bution thereof needs to assume a quite dif- 
ferent purpose on the part of the actor. 
Thus. if the actor's behavior is concept- 
ualized as helping his mother and its pur- 
pose as earning money, an exogenous 
attribution would be implied, whereas if 
the purpose had been also identified as help- 
ing his mother, the attribution would be 
endogenous. Conversely, conceptualizing 
the action as earning money would lead to an 
endogenous attribution, if the actor's end 
has been similarly identified, whereas if the 
end had been identified as helping his mother, 
the action would appear to be exogenously 


motivated. 

But what may be the conditions for as- 
signing a given identity to an action? The 
assumption that identity attributions are hy- 


pa 
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pothetical suggests the applicability of p 
discounting principle and of the attributiona 
criteria discussed already in reference to a 
problem of causality. For instance, Page 
ing to the discounting rule, the greater the 
number of hypotheses about an effect s (eg. 
an action’s) identity the lower will be the 
attributor’s confidence in each such identity. 
Furthermore, a given identity hypothesis 
would be rendered more plausible the greater 
the amount of corroborative evidence in its 
favor. As in the case of causality, the evi- 
dence regarding identity could be assessed 
by reference to the criteria of consensus, 
consistency, or the convergence of informa- 
tion. In our earlier example, Such criteria 
might more readily suggest the action’s iden- 
tity to be, say, Sweeping the floor, than 
helping the actor’s mother, 
Earlier it has been implied that with an 
action’s constant identity, endogeneity of at- 
tribution will vary with its Perceived caus- 
ality. The present discussion Teverses this 
reasoning to suggest that with a constant 
causality (ie., the action’s Purpose) the en- 
dogeneity of attribution will vary with this 
action's perceived identity, Thus, endoge- 
nous attribution wi 


action is conceptual 
Cie., as the attainment thereof) and exoge- 


terms that are foreign to its goal. 
ingly, endogenous attribution will be more 
likely when (a) no such foreign identity hy- 
potheses will be entertainable and (b) cor- 


The endogeneity question, 
of endogenous-exogenous att 
tified thus far involved mo 
tributor's conception of the 
cause so as to vary the perceived similarit 
between the two. But how May one affect 
the likelihood of an endogenous attribution 
when the identities of both the action and 
the cause are relatively constant and refrac- 
tory to change? In such a case a Category 
of relevant conditions seems implied in the 
relation of endogeneity or CXogeneity that 
may link action and goal, Specifically, the 


The conditions 
ributions iden- 
difying the at- 
action and its 


endogeneity concept connotes the action’s 
inherence to or inseparability from its goal. 
Thus, given a dissimilarly conceptualized 
action and goal, one could promote endoge- 
nous attribution by suggesting that the 
former is an integral part of the latter. For 
example, the young man who believes that 
only by dint of superior performance will he 
achieve the goal of rising in the company's 
ranks may become more endogenously moti- 
vated to his work than his counterpart who 
also contemplates alternative ways of attain- 
ing this objective (eg., having good connec- 
tions, marrying the boss's daughter). 


RECAPITULATION AND CONCLUSIONS 


The central thesis of th 


that the partition between endogenous and 
exogenous reasons is of considerable impor- 
tance in the lay explanation of actions. A 
theoretical framework has been developed 
whereby the action’s endogenous attribution 
implies the actor’s intrinsic motivation and 
subjective freedom, In addition, with knowl- 
edge of the action’s cor 

tribution has been sho 
tion’s specific intenti 
dogenous-exogenous 
Proposed to rep] 
Partition of the 
ternal and exterr 


is paper has been 


The present en. 
distinction has been 
ace the frequent alternative 
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lay assumption, actions are 
» Which is internal. 
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In part) Independent of th 


eworks have been com- 
degrees of Conceptual coher- 
ical Support in the domain of 
action explanation, Tt has been noted that 
3 of the internal-ex- 
y suffer from two con- 

: (a) the problem of cri- 
the problem of inference. 
Problems is that the classi- 
C causes into the internal 

and external Categories has typically lacked 


Justification, and the two criteria 
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that seem to underlie such categorizations 
(the notions of generically internal or ex- 
ternal causes and of the uniqueness stan- 
dard) need not coincide. The second prob- 
lem is that the existing treatments of the 
internal-external distinction furnish little ra- 
tionale for linking it with the various infer- 
ences with which it has been credited. Yet 
numerous examples may be imagined where 
the alleged inferences from internal versus 
external attributions (e.g., as interpreted in 
terms of the uniqueness standard) would 
seem a priori unlikely. By contrast, the 
endogenous-exogenous partition seems clear 
of the problems just outlined. The concep- 
tual criterion for this distinction derives 
from the common differentiation. between 
ends and means, and the inferences implied 
by the endogenous-exogenous partition are 
explicitly deducible therefrom, as shown in 
an earlier section of this paper. 

On the empirical level, it has been noted 
that whenever research findings on actions 
have been explained by reference to the in- 
ternal-external distinction, a reinterpretation 
was possible in terms of the endogenous- 
exogenous distinction. Furthermore, spe- 
cific experimental comparisons yielded con- 
sistent support for the endogenous-exoge- 
nous distinction and failed to support the 
internal-external one. 

The present article suggests several pos- 
sible avenues for further research. To sam- 
ple just a few: 

1. Additional data seem necessary to dem- 
onstrate that internal attributions (unique- 
ness, self-selection, etc.) may not provide 
the explanatory implications with which they 
have been credited in the numerous areas of 
attributional research. 

2. The broad claims connected with the 
endogenous-exogenous partition (of action's 
; merit a careful empirical study. 
e, the contention that endogenous 
sufficient 


reasons) 
Tor instance 
attribution. is 
condition of perce 


the necessary and 
ived freedom needs to he 
examined against. the recently growing nt 
erature on the freedom notion. Similarly, 
the suggestion that inferences about > 
tional behaviors may be explicable by refer 
ence to the endogenous-exogenous distinc- 
tion needs empirical substantiation. 


3. Finally, the presently proposed condi- 
tions of endogenous versus exogenous attri- 
bution require a close investigation. It is 
by thorough understanding of such condi- 
tions that the theory of endogenous attribu- 
tion may find its application in concrete 
situations of practical significance. 
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This paper presents a spreading-activation theory of human semantic pro- 
cessing, which can be applied to a wide range of recent experimental re- 
sults. The theory is based on Quillian's theory of semantic memory search 


and semantic preparation, or priming. 
the misconceptions concerning Qullian’s theory are discussed. 


In conjunction with this, several of 
A number 


of additional assumptions are proposed for his theory in order to apply it 


to recent experiments. 


The present paper shows how the extended theory 


can account for results of several production experiments by Loftus, Juola 
and Atkinson's multiple-category experiment, Conrad's sentence-verification 
experiments, and several categorization experiments on the effect of semantic 
relatedness and typicality by Holyoak and Glass, Rips, Shoben, and Smith, 
and Rosch. 'The paper also provides a critique of the Smith, Shoben, and 
Rips model for categorization judgments. 


Some years ago, Quillian? (1962, 1967) 
proposed a spreading-activation theory of 
human semantic processing that he tried to 
implement in computer simulations of mem- 
ory search (Quillian, 1966) and com- 
prehension (Quillian, 1969). The theory 
viewed memory search as activation spread- 
ing from two or more concept nodes in a 
semantic network until an intersection was 
found. The effects of preparation (or prim- 
ing) in semantic memory were also ex- 
plained in terms of spreading activation from 
the node of the primed concept. Rather 
than a theory to explain data, it was a the- 
ory designed to show how to build human 
semantic structure and processing into a 


computer. 
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ý Quillian’s theory of priming appeared in the 
unpublished version of th 
70, Carnegie 


he 1967 paper (ie, CIP 
Paper No. Institute of Technology, 
1965). 


Since the theory was proposed, there have 
been a number of experiments investigating 
retrieval and priming in semantic memory. 
In the present paper, we attempt to show 
how an elaboration of Quillian's basic theory 
can account for many of the results. In the 
first section, we briefly review the original 
theory while trying to correct a number of 
the common misunderstandings concerning 
it. In the second section, we extend the 
theory in several respects, and in the third 
section show how the extended theory deals 
with some recent experimental findings. In 
the fourth section we compare the theory to 
the model of Smith, Shoben, and Rips 


(1974). 


QUILLIAN'S THEORY OF SEMANTIC MEMORY 


The fact that Quillian’s theory was devel- 
oped as a program for a digital computer 
imposed certain constraints on the theory, 
which Quillian felt were psychologically un- 
realistic. We will recount the theory as he 
proposed it, and then elaborate the theory 
in psychological terms. The theory made 
a number of assumptions about structure and 
processing in human semantic memory. A 
brief discussion of these assumptions follows. 

People's concepts contain indefinitely large 
amounts of information. Quillian used the 
example of a machine. If one asks people 
to tell everything they know about machines, 
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they will start off giving obvious ges 
for example, that machines are man-made 
and have moving parts. But soon people 
Tun out of obvious facts and begin giving 
facts that are less and less relevant, for ex- 
ample, that a typewriter isa machine Or even 
that the keys on the IBM electric type- 
writers select the position of a ball that 
strikes the ribbon against the paper. 'The 
amount of information a person can gen- 
erate about any concept in this Way seems 
unlimited, 

In these terms concepts correspond to par- 
ticular senses of words or phrases, For ex- 
ample, not only is the noun , machine’ a 
concept, but the verb “to machine" is a con- 
cept; the "particular old car I ow is a 
concept; the notion of "driving a car" is a 
concept; even the notion of "what to do if 
you see a red light” has to be a concept. 
Thus, people must have a very large num- 
ber of concepts, and concepts must have very 
complicated structures, 


the concept 


al links from 
the node to other concept nodes. These 


Links can 
» Which are num- 
each link is to 
The criterialj- 


criterial for the concept of 


kind is a typewriter, From each of the 
nodes linked to a given node, there wil] be 
links to other concept nodes and from each 
of these in turn to still others, In Quil- 


lian’s theory, the full Meaning of any con- 
ntered from 


cept is the whole network as e 
the concept node, 

The links are not simply 
links, but must be complic 
Tepresent any relation between two Concepts. 
In the original theory, Ouillian Proposed five 
different kinds of links: (a) Superordinate 
("isa") and subordinate links, (b) modifier 
links, (c) disjunctive sets of links, (d) con. 


undifferentiated 
ated enough to 


junctive sets of links, and (e) a — 
class of links, which allowed the specifica 
ton of any relationship where. the falsae 
ship (usually a verb relationship ) itself wW 2 
a concept. These different kinds of links 
could be nested or embedded to any degree 
of depth, so that. the format was designed to 
be flexible enough to express anything, how- 
ever vague or specific, that can be expressed 
in natural language. 

The search in memory between concepts 
involves tracing out in parallel (simulated 
in the computer by a breadth-first search) 
along the links from the node of each cone 
cept specified by the input words. _ The 
words might be part of a sentence or stimuli 
in an experimenta] task, The spread of ac- 
tivation constantly expands, first to all the 
nodes linked to the first node, then to all 
the nodes linked to each of these nodes, and 
So On. At each node reached in this pro- 
cess, an activation tag is left that Specifies 
the starting node and the immediate prede- 
cessor, When a tag from another starting 
node is encountered, an intersection between 
the two nodes has been found.? By follow- 
ing the tags back to both starting nodes, the 


path that led to the intersection can be 
reconstructed, 


When an inte 


Tsection has heen found, it is 
necessary to ey 


aluate the path to decide if it 
Satisfies the constraints imposed by syntax 
and context, The complicated kinds of de- 
cision rules that are invoked for comprehen- 
sion of Sentences in this evaluation phase 
are described in Quillian (1969). For cate- 


Sorization fasks these tules are described 
by Collins and Quillian (1972b) and in the 
NeXt section of this Paper, 

m a phrase such as “the fall 
found betwee: 


As an example, 
leaves,” a path 
n the concept “to fall” and the 


- OE 
? There can be intersections wi 
Starting nodes, hut we hav 
mn this paper to 
Quillian, 1966, 
in Quillian’s 


aese were discussed by 
"n regard to comprehension, but 
Sidered in this paper, only 
starting nodes needs to be con- 
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concept "tree leaf" would be rejected as a 
wrong interpretation, because syntax re- 
quires a participial form of “fall” to fit that 
interpretation. If a path found is rejected, 
other paths are considered in the order in 
which they are found. 

Priming (or preparation) involves the 
same tracing process that was described for 
memory search. When a concept is primed, 
activation tags are spread by tracing an ex- 
panding set of links in the network out to 
some unspecified depth. When another con- 
cept is subsequently presented, it has to 
make contact with one of the tags left earlier 
to find an intersection. One of the non- 
obvious implications of this view of priming 
is that links as well as nodes will be primed. 
This is because Quillian treated links them- 
selves as concepts (see above). Thus prim- 
ing a node such as "red" will prime the links 
involving the relation “color” throughout the 
network. This provides a very powerful 
context mechanism. 


Common Misinterpretations of 
Quillian’s Theory 

There is a rich variety of misinterpreta- 
tions of Quillian’s theory, many of them de- 
riving from Collins’s (Collins & Quillian, 
1969, 1970a, 1970b) simplifications of the 
theory. The problem arose because Collins 
and Quillian were investigating specific as- 
pects of the theory and only described 
enough of the theory to motivate a particu- 
lar experiment. In turn experimenters made 
interpretations of these simplified versions, 
which did not fit with the theory as de- 
scribed elsewhere (Bell & Quillian, 1971; 
Quillian, 1966, 1969). 

Perhaps the most prevalent misinterpreta- 
f Quillian's theory concerns the idea 
of cognitive economy (Anderson & Bower, 
1973; Conrad, 1972). In this regard, it is 
important to distinguish the strong theory of 
cognitive economy, which Conrad takes issue 
with in her attack on Collins and Quillian 
(1969), and the weak theory of cognitive 
economy, which Collins and Quillian were 
testing (though they did not spell it out 
clearly enough). As Conrad (1972) sae 
she rejects the “hypothesis that all proper 


tion 0 


ties are stored only once in memory and 
must be retrieved through a series of infer- 
ences for all words except those that they 
most directly define" (p. 153). This is a 
statement of the strong theory of cognitive 
economy, Undoubtedly the Collins and 
Quillian (1969) paper gave rise to this no- 
tion, but the authors cautioned against mak- 
ing that interpretation of the theory. As 
they said, “people surely store certain prop- 
erties at more than one level in the hier- 
archy" (p. 242), and they cited the maple 
leaf as an example of this general rule. 

The strong theory requires erasing infor- 
mation whenever it applies at a more gen- 
eral level. If a person learns a robin can 
fly and then later that birds fly, the strong 
theory implies that “flying” must be erased 
from “robin.” The weak theory of cogni- 
tive economy merely assumes that every 
time one learns that X is a bird, one does not 
at that time store all the properties of birds 
with X in memory. Thus, an inference will 
be necessary to decide that X can fly, unless 
one encounters this fact directly, Hence, 
Collins and Quillian, in testing the weak 


theory of cognitive economy, picked in- 


stances where people were not likely to have 
encountered the general property with the 
specific instance (e.g., “A wren can fiy”). 
The point of the experiment was to test 
whether it was possible to measure infer- 
ence time, when the weak theory of cogni- 
tive economy implies that an inference is 
likely to be necessary for most subjects. 
Another assumption sometimes made about 
Quillian’s theory is that all links are equal 
(Anderson & McGaw, 1973; Rips, Shoben, 
& Smith, 1973; Wilkins, Note 1). In Quil- 
lian's original theory, there were criteriality 
tags on links, as we described earlier. In 
Collins and Quillian (1969, 1972b) links 
were assumed to have differential accessi- 
bility (ie. strength or travel time). The 
accessibility of a property depends on how 
often a person thinks about or uses a prop- 
erty of a concept. Whether criteriality and 
accessibility are treated as the same or dit- 
ferent is a complex issue, but network mod- 
els allow them to be treated either way. 
Thus for example, even though “lungs,” 
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“hands,” and “warts” are all linked directly 
to the concept "human," these links need 
not be in any sense equal. » The same is 
true for the links between." bird and its 
exemplars, such as "robin," chicken," or 
"penguin." Rips et al. (1973) suggest that 
intermediate nodes are necessary for a net- 
work model to explain the reaction time dif- 
ferences they find in categorizing different 
birds. This makes the mistaken assumption 
that all links are equally criterial or accessi- 
ble in any network model. It turns out, 
however, that differences in links are crucial 
to many different aspects of human seman- 
tic processing as Carbonell and Collins 
(1973) point out in their discussion of im- 
portance (or criteriality) tags. 

A related implication of the Rips et al. 
(1973) paper and also a more recent paper 
of Smith et al. (1974) is that feature models 
can account for data that network models 
cannot. A feature model posits that a con- 
cept consists of a set of values on a large 
number of semantic dimensions ( eg., ani- 
mateness, color, etc.). What is strange 
about this argument is that network models 
were developed as a method of representing 
features in a computer. Any process that 
can be represented in a feature model is 
representable in a network model; in par- 
ticular, the Smith et al. 


model itself could 
be implemented in a semantic network (Hol- 
lan, 1975). In fact, network models are 
probably more powerful than feature mod- 
els, because it is not obvious how to handle 
inferential processing 


or embedding in fea. 
ture models, 


Smith et al. (1974) argued in favor of 
feature models because their data for com- 
parison of concepts seemed to fit 
comparison process, 
phasized about Quilli 
parallel search would 


a feature 
What should be em- 


an’s theory is that the 


inevitably lead to just 
such a feature comparison Process, though 


the process would take place over a period 
of time as different connections are found, 
One way that Quillian’s theory is different 
from the Smith et al. models is that super- 
ordinate connections, if they exist, would 
also be found and evaluated, The distinc- 


tion between these two theories is so crucial 
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that we will discuss it at length in conjunc- 
tion with the spreading activation theory's 
explanation of the Rips et al. (1973) results. 

Another common misconception of Quil- 
lian’s theory shows up in Juola and Atkin- 
son’s (1971) work on categorization judg- 
ments. In a categorization task, response 
time is measured for a subject to decide 
whether or not a particular instance (e.g. 
"car") is a member of one or more cate- 
gories (e.g., "flower" or "vehicle"). Juola 
and Atkinson assume that in Quillian's the- 
ory the memory search to make a categoriza- 
tion judgment proceeds from the instance 
to the category. In fact, the wording m 
Collins and Quillian (1969) mistakenly gives 
that impression, But Quillian's theory 
(1966, 1969) assumes the search proceeds 
from both the instance and category in par- 
allel. However, if one or the other is pre- 
sented first, this gives the search from that 
node a head Start, which is the notion of 
priming. Juola and Atkinson's experiment 
involves priming in a complicated way, 
Which we will discuss below. 

Anderson and Bower (1973) reject a 
Quillian-like model of a parallel search, 
while acknowledging that their data are 


compatible with “a parallel model whose 
search rate is slower in proportion to the 
number of paths that must be searched” (p. 


371). Anderson and Bower’s argument 
implies wrongly that Quillian has made the 
independence assumption for his parallel 
Search. An independent parallel search is 
like a race where the speed of each runner 
15 independent of the other runners, This 
1$ à common assumption in psychology, be- 
Cause it makes it possible to assign an upper 
bound to Teaction time (see Sternberg, 
1966), But there is no difficulty for Quil- 
lian's theory if the parallel search rate de- 
pends on the number of paths searched. 

ence, Anderson and Bower’s data are 


Perfectly compatible with Quillian's parallel 
search, 


The above discussion, 
Quillian’s the 
what it is n 
tions are 


(19721), 


then, shows what 
Ory is not, or at least some of 
ot. Several other misconcep- 
discussed in Collins and Quillian 
in particular the notion that Quil- 


"y 
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lian's theory of memory is rigidly hierarchi- 
cal, which Anderson and Bower (1973, p. 
379) still believe, and Schaeffer and Wal- 
lace's (1970) argument that Quillian's the- 
ory predicts it will always take less time to 
compare concepts that are close together in 
the semantic network. We will return to 
some of these same papers below, in order 
to describe how the extended version of 
Quillian’s theory accounts for some of the 
results these experimenters have used to 
reject Quillian’s theory. 


THE EXTENDED THEORY 


In order to deal with the specific experi- 
mental results that have appeared in recent 
years, several more processing and struc- 
tural assumptions must be added to the basic 
Quillian theory. These do not bend the 
theory, but merely elaborate it in such a 
way that it can be applied to the kinds of 
experiments on semantic memory that have 
been performed recently. The elaboration 
may itself be wrong, so our mistakes should 
not be held against Quillian's theory. 


Local Processing Assumptions 


There are four local processing assump- 
tions in the extended theory. These four 
assumptions transform the theory from com- 
puter terms to quasi-neurological terms, a la 
Pavlov. But all the assumptions of the origi- 
nal theory should be preserved despite the 
transformation, except that activation tags 
are to be considered as source-specific acti- 
vation (i.e., activation that is traceable to 
its node of origin). 

1. When a concept is processed (or stim- 
ulated), activation spreads out along the 
paths of the network in a decreasing gradi- 
ent. The decrease is inversely proportional 
to the accessibility or strength of the links 
in the path. Thus, activation is like a sig- 
nal from a source that is attenuated as It 
travels outward. N . 

2. The longer à concept is continuously 
processed (either by reading, hearing, OF 
rehearsing it), the longer activation 1s 1e 
leased from the node of the — at a 
fixed rate. Only one concept can z = 
tively processed at a time, which is a limita 


tion imposed by the serial nature of the 
human central process (Collins & Quillian, 
1972b). This means that activation. can 
only start out at one node at a time. But 
it continues in parallel from other nodes 
that are encountered as it spreads out from 
the node of origin. 

3. Activation decreases over time and/or 
intervening activity. This is a noncommittal 
assumption that activation goes away gradu- 
ally by some mechanism. Assumptions 2 
and 3 impose a limitation on the amount of 
activation that can be allocated in priming 
more than one concept, because the more 
concepts that are primed, the less each will 
be primed. 

4. With the assumption that activation is 
a variable quantity, the notion of intersection 
requires a threshold for firing. The assump- 
tion is that activation from different sources 
summates and that when the summation at 
the point of intersection reaches threshold, 
the path in the network producing the in- 
tersection will be evaluated. 


Global Assumptions About Memory 
Structure and Processing 


There are three assumptions in the ex- 
tended theory concerned with the global 
structure of memory and its processing. 
These are generalizations of Loftus's (Note 
2) arguments that semantic memory is or- 
ganized primarily into noun categories and 
that there is a “dictionary” (or lexical 
memory) separate from the conceptual net- 
work. 

5. The conceptual (semantic) network is 
organized along the lines of semantic simi- 
larity. The more properties two concepts 
have in common, the more links there are 
between the two nodes via these properties 
and the more closely related are the con- 
cepts. This means that different vehicles 
or different colors will all be highly inter- 
linked through their common properties. 
This also implies that red things (e.g. fire 
engines, cherries, sunsets, and roses) are 
not closely interlinked, despite the one prop- 
erty they have in common. In these terms 
semantic relatedness is based on an aggre- 


ed 


x 
AMBULANCE) 


Ficung 1. A 
typical fragment 
relatedness), 


schematic Tepresentation of Concept relatedness in à stereo- 
of human memory (where a shorter line Tepresents greater 


gate of the 


interconnections 
concepts,* 


between two 


gate of all the paths, 
in distance, say by a path through "red," and still 
not be closely related because that is 
path. Our use of close to refer to both 
ships is admittedly confusing. In this Paper we 
shall use close to refer to relatedness or similarity, 
though in some tasks (Quillian, 1966) it i. only 
distance that matters. 


relation- 
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Figure 1 illustrates this aggregate notion 
of Concept relatedness for a hypothetical 
human memory. (It is the kind of diagram 
that the scaling techniques of Rips et al., 
1973, would produce.) 
various vehicles 
lated, because of the nu 


sumed to exist be- 

versely, the Concepts asso- 
ciated with “red” are shown as less related, 
Tesumed paucity of inter- 
een them, "—— 
mption that memory is Or- 
8 to semantic similarity, to- 
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gether with earlier assumptions, it follows 
that if “vehicle” is primed, activation at any 
type of vehicle will accumulate from many 
neighboring nodes. That is to say, to the 
degree that “fire engine” is primed by “ve- 
hicle," it will in turn prime “ambulance,” 
“truck,” “bus,” etc., and each of these in 
turn will prime the others. On the other 
hand, if "red" is primed, the activation that 
spreads to "fire engine" will not prime 
“cherries,” "roses," or “sunsets” to any 
great extent, because there are so few con- 
nections between these concepts. Instead, 
“fire engine" will tend to prime other ve- 
hicles, and “cherries” to prime other fruits. 
Hence, the same amount of activation will 
be diffused among a greater number of con- 
cepts. 

6. The names of concepts are stored ina 
lexical network (or dictionary) that is or- 
ganized along lines of phonemic (and to 
some degree orthographic) similarity. The 
links from each node in the lexical network 
are the phonemic properties of the name, 
specified with respect to their position in the 
word. The properties stored about names 
are assumed to be the properties that Brown 
and McNeill (1966) found people could 
identify about words on the “tip of their 
tongue.” Each name node in the lexical 
network is connected to one or more concept 
nodes in the semantic network. 

7. Loftus’s (Note 2) data lead to the fur- 
ther assumption that a person can control 
whether he primes the lexical network, the 
semantic network, or both. For example, a 
person can control whether to prime (a) 
words in the lexical network that sound like 
“pird,” (b) concepts in the semantic net- 
work related to "bird," or (c) words in the 
lexical network corresponding to the con- 
cepts in (b). This control over priming 
can be thought of in terms of summation of 
diffuse activation for an entire network (per 
haps in a particular part of the s ane 
source-specific activation release ment © 

sou. node DMB; (a) would derive 

particular no : ae c 

+ ation of the lexical network to 

from activation “pind.” (b) would de- 

gether with the word "bird," (b). rk 

P tonti f the semantic network 
rive from activation 0 


together with the concept “pird,” and (c) 
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would derive from activation. of both net- 
works together with the concept “bird.” 


Assumptions About Semantic 
Matching Process 


‘There are a number of assumptions about 
the decision process for evaluating whether 
or not two concepts match semantically. 
This is a fundamental process that occurs 
in many aspects of language processing, 
such as matching referents, assigning cases, 
and answering questions (Collins & Quil- 
lian, 1972b; Collins, Warnock, Aiello, & 
Miller, 1975). Categorization tasks, which 
ask “Is X a Y?” (where X and Y are con- 
cepts), directly investigate this process. The 
decision process described here is a more ex- 
plicit and somewhat revised version of the 
process postulated by Collins and Quillian 
(1972b), with additions to encompass the 
results of Holyoak and Glass (1975). 

&. In order to decide whether or not a 
concept matches another concept, enough 
evidence must be collected to exceed either 
a positive or a negative criterion. The evi- 
dence consists of various kinds of intersec- 
tions that are found during the memory 
search. Evidence from different paths in 
memory sum together. Positive and nega- 
tive evidence act to cancel each other out, as 
shown by dialogue excerpts in Carbonell and 
Collins (1973). Failure to reach either cri- 
terion before running out of relevant evi- 
dence leads to a “don’t know' response 
(Collins et al., 1975). This process is es- 
sentially the Bayesian decision model that is 
common in the reaction time literature (see, 
for example, Fitts, 1966; Stone, 1960). 

There are a number of different kinds of 
paths between the two concepts that consti- 
tute positive or negative evidence. Any of 
these types of evidence might contribute to 
a particular decision. The different types 
are listed in Table 1 and described below in 
Assumptions 9-13. 

9. If the memory search finds that there 
is a superordinate (or a negative superordi- 
nate) connection from X to Y, that fact 
alone can push the decision over the posi- 
tive (or negative) criterion. Superordinate 


links act like highly criterial property links 


E 


= ee Wer 
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TABLE 1 


Types or Parus Founp iN MEMORY 
THAT CONSTITUTE POSITIVE OR 
NEGATIVE EVIDENCE 


Positive evidence Negative evidence 


i M ive superordinate 

Superordinate connection Negative sup ! 
Property comparison, 

distinguishing property 
Wittgenstein strategy, 

distinguishing property 
Mutually exclusive 

subordinates 
Counterexamples 


Property comparison, 
matching property 
Wittgenstein strategy, 
matching property 


(see below). For example, it is conclusive 
positive evidence that a mallard is a bird, if 
superordinate links are found between “mal- 
lard" and "duck" and between "duck" and 
"bird." Similarly, if a negative superordi- 
nate link is found between “bat” and "bird," 
it is conclusive evidence that a bat is not 
a bird. 

10. If the memory search finds properties 
on which X and Y match (i.e, common 
properties), this is positive evidence propor- 
tional to the criteriality of the property for 
Y. If the memory search finds properties 
on which X and Y mismatch (i.e., distin- 
guishing properties), this is negative evi- 
dence proportional to the criteriality of the 
property for Y. There is an asymmetry in 
the weighing of positive and negative evi- 


dence in a Property comparison, because a 
mismatch on just one fairl 


erty can lead to 


memory search finds them, 
guishing properties make it ha 
the positive criterion when the 
ordinate connection and theref 
the process, 

As an example of property 
suppose there is no superordinat 
in a particular person's mem 
"mink" and 


“ 


Thus, distin- 
rder to reach 
re is a Super- 
ore slow down 


comparison, 
€ connection 
Ory between 
and between 
Then the deci- 
T Cats are farm 


“farm animal" 
cat" and "farm animal.” 
sion as to whether minks o 
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animals might be based on a comparison of 
the properties of minks or cats on one hand, 
and farm animals on the other. The most 
criterial properties of farm animals are pre- 
sumably being animate and being kept on 
farms, but other less criterial properties 1n- 
clude being domesticated, being raised for 
some purpose, or being kept in barns or 
outside. How a particular person would 
weigh the various properties of minks and 
cats to decide whether they are farm ani- 
mals would vary from person to person (our 
intuition is that minks are farm animals and 
cats are not, even though both have the two 
most criterial properties of farm animals— 
what Smith et al, 1974, called defining 
properties). This decision Strategy is simi- 
lar to that proposed by Smith et al., as we 
will discuss later, 

11. The Wittgenstein Strategy is a variant 
of the property comparison strategy. It is 
postulated on the basis of Wittgenstein's 
(1953) observation that to decide whether 
something is a game (for example, frisbee), 
a person compares it to similar instances that 
are known to be games, Our assumption is 
that if any Properties of X are found that 
match properties of ano 
Superordinate is Y, these constitute posi- 
tive evidence. Similarly, any distinguishing 
Properties constitute negative evidence. In 
the Wittgenstein Strategy, unlike the prop- 
erty comparison Strategy, matching proper- 
ties count just as much toward a positive 
decision as distinguishing properties count 
toward a negative decision. 

To illustrate the Wittgenstein strategy, 
Collins and Quillian (1972b) pointed out 
that in deciding whether a stagecoach is a 
vehicle, it might be compared to a car. The 
many properties that a Stagecoach has in 
T constitute strong posi- 
a stagecoach is a vehicle. 
Stagecoach does not have 
highly criteria] for being 
s is strong evidence that 
t a car, it is only weak 
hot a vehicle. This illus- 
me evidence is weighed 
Toperty comparison strat- 
ttgenstein strategy. The 


ther instance whose 


& motor, which is 
a car. Though thi 
a stagecoach is no 
evidence that it is 1 
trates how the sa 
differently in the p 
esy and the Wi 
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final decision that a stagecoach is a vehicle 
might depend both on matching properties 
between a stagecoach and vehicles in gen- 
eral (conveyance, motion, etc.) and match- 
ing properties between a stagecoach and par- 
ticular vehicles like a car (seats, doors, etc.). 
Thus the property comparison strategy and 
the Wittgenstein strategy might combine to 
determine a person's response. 

12. The mutually exclusive subordinates 
strategy was necessary for programming a 
computer to answer questions (Collins et al., 
1975). Holyoak and Glass (1975) argue 
that this strategy accounts for some of their 
reaction time data (they call it a contradic- 
tion). The assumption is that if two con- 
cepts have a common superordinate with 
mutually exclusive links into the common 
superordinate, then this constitutes strong 
negative evidence, almost comparable to a 
negative superordinate link. 

For example, if the question is whether a 
mallard is an eagle, the fact that a mallard 
is a duck and ducks and eagles are mutually 
exclusive kinds of birds is rather conclusive 
evidence that a mallard is not an eagle. 
Though Holyoak and Glass (1975) do not 
mention it, the mutually exclusive restriction 
is necessary. For example, the fact that 
Mike Mansfield is a politician does not ex- 
clude him from being a lawyer. Although 
“politician” and “lawyer” are both occupa- 
tional roles, they are not mutually exclusive 
and in fact most politicians are lawyers. 
But lacking specific information to the con- 
trary, people may make a default assump- 
tion of mutual exclusivity when two con- 
cepts have a common superordinate. 

13. Counterexamples also can be used as 
negative evidence. This strategy derives 
from Holyoak and Glass (1975), who argue 
that statements of the kind “All birds are 
disconfirmed by finding a 
such as “robin.” If the 


« ; 

question is of the form fis Sa TP = 
. d e rus 

there is à superordinate link from Y to e 


inv nd- 
then finding 2 counterexample inv olves fir 
Xas superordinate and 


y. This is con- 
F 
X is not always à X. 


canaries" are 
counterexample, 
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Holyoak and Glass (1975) discusss coun- 
terexamples in the context of the universal 
quantifier "all," but the same process would 
occur for a question of the kind “Is a mar- 
supial a kangaroo?” In such a case, Te- 
trieving a counterexample (such as à wall- 
aby) can be used to determine that kanga- 
roos are a subset of marsupials and not 
equivalent (e.g. "automobiles" and "cars" 
are equivalent concepts). 

Though these five kinds of evidence (As- 
sumptions 9-13) are the only ones we have 
postulated for the semantic matching pro- 
cess seen in categorization tasks, there may 
be other kinds of evidence of this sort. We 
should stress that there are many other 
kinds of evidence people use for answering 
more complicated questions (Collins et al., 
1975). It is beyond the scope of this paper, 
however, to consider all the different ways 
people use evidence to make semantic deci- 
sions. 


Recent EXPERIMENTS 


In this section, we discuss how the theory 
deals with some different kinds of recent 
experiments. The four types of studies to 
which we apply the theory are (a) several 
production experiments by Loftus (Freed- 
man & Loftus, 1971; Loftus, 1973a, 1973b, 
Note 2) ; (b) Juola and Atkinson's (1971) 
multiple-category experiment; (c) the Con- 
rad (1972) sentence-verification experiment ; 
and (d) several categorization experiments 
on the effects of semantic relatedness and 
typicality (Holyoak & Glass, 1975; Rips et 
al., 1973; Rosch, 1973; Smith et al., 1974). 
We intend to deal with the major kinds of 
available findings to which the Quillian the- 
ory has not yet been applied. Our objective 
is to show how a spreading-activation theory 
can handle these results, not to consider all 
the possible alternative explanations of the 
experiments. 


Production Experiments of Loftus 

There are several Loftus experiments we 
want to discuss in terms of the spreading- 
activation theory. The first of these is an 
experiment by Freedman and Loftus (1971), 
in which subjects had to produce an in- 
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stance of a category that began with mere? 
letter or was characterized by a perd E 
jective. For example, subjects pos Es 
ked to name a fruit that begins wi 
Des A or a fruit that is red. On "es 
trials the category was shown first an 0 
some trials second. Hence, this was a prim- 
ing experiment in that one concept was m 
tivated before the other. Reaction time w m: 
measured from the onset of the secon 
M enr is with the finding that sub- 
jects were faster when the category (eg., 
"fruit") was given first than when either 
the letter or the adjective was given first. 
This basic result was later replicated even 
for cases in which the instance named was 
a more frequent associate to the adjective 
than to the category noun: (e.g., lemon ' is 
a closer associate of “sour” than of fruit”). 
The explanation in terms of the theory 
is as follows: When a noun, such as “fruit, 
is presented first, the activation spreads to 
nodes connected to “fruit, among which are 
instances such as “apple,” pear,” “peach, 
“orange,” and “lemon.” But these concepts 
are all highly interlinked with each other 
(though some, such as "orange" and 
"lemon," are more closely interlinked than 
others). Thus, the total amount of activa- 
tion is spread among a relatively small num- 
ber of closely interlinked concepts (see As. 
sumption 5). However, when an adjective 
Or letter is Presented first, say "red" or 
“A,” the activation Spreads to a much wider 
Set of concepts, Which are not particularly 
interlinked with 


large variety of different things tl 


accumulation of 
these are Closer 
So that it takes 
Ss time, to trig- 


hen the Second stimulus 
is presented, 


Freedman and Loftus (1971) 
their finding in terms of entering the cate- 


gory when a noun is presented and entering 
a cluster within the category when the adjec- 


explained 
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tive or letter is presented. Thus if the 
noun is presented first, the subject can pes 
the category immediately and need only 
choose the correct cluster when the adjective 
or letter is presented. But if the adjective 
or letter is presented first, the subject minst 
wait until the category is presented, because 
the cluster is specific to that category. (How- 
ever, Loftus, Note 2, has revised this expla- 
nation for the letter stimulus in her dic- 
tionary-network model.) aa 

The Freedman and Loftus explanation 1s 
not altogether different from the explanation 
offered here, though our theory is less rigidly 
hierarchical, The rigid hierarchy gets into 
trouble with errors Such as one we encoun- 
tered where a subject produced “Ben Frank- 
lin," given the stimulus pair "president" 
and “F,” although he later recognized his 
mistake, In an activation theory, “Frank- 
lin" is a Very likely intersection starting at 
"president" and “F,” because he is so closely 
linked with the concept, “president,” and 
Some of its foremost instances, such as 
“Washington.” Such a Wrong intersection 
Was likely in this case because the correct 
answer (prior to Ford) was “Fillmore,” 
who is rather inaccessible and unlikely to be 
found quickly enough to preclude finding 


“Franklin.” “Once such an intersection is 


found, it is only by evaluating the connec- 


tion between “president” and "Franklin," 
that it can pe rejected (see Assumptions 
8-13) i 


tis a genera] problem of category- 
Search models that they cannot deal with 
such errors, 
Perhaps the major 
Spreading-activation theo 
man and Loftus (1971) 
tying their result t 
a quite different 
(19736). Tn a cat 
Loftus found that th 
ciation betw, 


determined Whether Subjects were faster 
When given the Category first or the instance 

In the experiment she used four kinds 
of Category-instance pairs: (a) pairs where 
both the Category and instance evoked the 
Other with high frequency (eg. "tree- 
oak”) : (b) pairs where the category evoked 


advantage of the 
ry over the Freed- 
explanation is in 
O a parallel result in 
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the instance with high frequency, but the in- 
stance evoked the category with low fre- 
quency (e.g. *seafood-shrimp") ; (c) pairs 
where the category evoked the instance with 
low frequency, but the instance evoked the 
category with high frequency (e.g., "insect- 
butterfly"); and (d) pairs where both the 
category and instance evoked the other with 
low frequency (e.g. *"cloth-orlon"). When 
the category was presented before the in- 
stance, reaction time for Conditions (a) and 
(b) was approximately equal and signifi- 
cantly faster than for Conditions (c) and 
(d). However, when the instance was pre- 
sented first, reaction time for Conditions 
(a) and (c) was approximately equal and 
significantly faster than for Conditions (b) 
and (d). That is to say, subjects are fast 
when the category is presented first, if the 
category evokes the instance with high fre- 
quency, and subjects are fast when the in- 
stance is presented first, if the instance 
evokes the category with high frequency. 
'The spreading-activation theory explains the 
pattern of reaction times in the following 
way, assuming that production frequency is 
a measure of the strength or accessibility of 
the path from one concept to another. When 
the first concept (ie, the one presented 
first) evokes the second with a relatively 
high frequency, this means that more activa- 
tion spreads to the second, and it takes less 
time to reach the threshold for an intersec- 
tion. Thus, the amount the first concept 
primes the second concept determines the 
reaction time. 

By comparing this experiment with the 
Freedman and Loftus (1971) study, it can 
be seen that the two results are exactly 
Based on our structural assump- 


parallel. a t 
tions, “fruit? primes apple” more than 
“ed” or the letter “A” primes "apple" in 


an and Loftus study. wr 
- reaction time occurs when “fruit” 
Bern presetted frst, Similarly in ie 
Loftus (1973b) study, when (^ — 
primes the instance most highly, e a » 
reaction times occur when the categ d 
-esented first. But when the ins ° 
os most highly, the short 
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is presented first. A spreading-activation 
explanation is quite compelling to account 
for the Loftus (1973b) results, and the 
theory offered here encompasses the order 
effect in both the Loftus study and the ear- 
lier Freedman and Loftus experiment within 
a single framework. 

Recently Loftus (Note 2) has found two 
different ways in which presenting a letter 
acts differently from presenting an adjective, 
in variations of the Freedman and Loftus 
paradigm. This has led her to the develop- 
ment of a dictionary-network model, which 
we will translate into spreading-activation 
terms. The first difference between pre- 
senting a letter and an adjective appeared 
when Grober and Loftus (1974) compared 
reaction time in two conditions: one where 
noun-adjective (e.g., *fruit-red") and noun- 
letter (eg, "fruit-A") trial] were ran- 
domly intermixed, and one where noun- 
adjective and noun-letter trials were sepa- 
rated into blocks. In all cases the noun 
preceded the adj ective or letter. The results 
of this experiment are shown in Figure 2. 
It is clear that when the subject knows a 
letter is coming, he can prepare forit, But 
in the mixed condition, the subject appar- 
ently prepares for either kind of trial the 
same way he prepares for an adjective trial, 
since adjective trials take the same amount 
of time in either case. The theory's descrip- 
tion of semantic processing on the adjective 
trials is the same as that given earlier for 
the Freedman and Loftus (1971) experi- 
ment, with the amendment that only the se- 
mantic network and not the lexical network 
would be diffusely primed before the ad- 
jective is presented. When an intersection 
is found in the semantic network, then the 
subject must retrieve the name from the 
lexical network. 

Loftus (Note 2) described what must 
happen on noun-letter trials in the blocked 
condition as follows: 

The first step of the process is entering the cate- 
The next step is à quasi-parallel simul- 
towards the Dictionary. That is 

bject traces some number of path- 
ways leading from category instances to the n. 
tionary representations of those instances. = 
step can be started during the interval between the 


gory. 
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to say, the su 
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This is essentially the spreading-activation 
explanation, if the dictionary is taken to be 
a lexical network. Rat 


“the subject traces some number of path- 
ways,” which suggest 


The differ- 
3 for the noun-letter trials 


depends on 
ct primes both networks 


may be either because ofa Principle of least 
effort (hence he could speed up his reac. 


tion time if he tried) or because there is Jess 
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activation available to the semantic network 
if both are primed (hence he will be n 
on noun-adjective trials if he primes bot! P 
As can be seen in Figure 2, the subject is 
much slower on noun-adjective trials than 
on noun-letter trials in the blocked condi- 
tion. This is accounted for by the fact that 
an intersection. on a noun-adjective trial 
occurs in the semantic network and requires 
the further step of retrieving the correspond- 
ing name in the lexical network, On the 
other hand, the intersection on a noun-letter 
trial occurs at the name in the lexical net- 
work, Therefore, the name does not then 
need to be retrieved. , 
The second result that shows up the dif- 
ference between adjectives and letters was 
predicted by Loftus from the dictionary-net- 
work model. In this experiment (Loftus & 
Cole, 1974) Subjects saw three stimuli, or- 
dered either noun, adjective, letter or noun, 
letter, adjective, For example, the three 
stimuli might be “animal,” "small" and 
“M,” for which an appropriate response is 
“mouse.” The Prediction was that the sub- 
ject should be faster when the adjective is 
Presented before the letter, and this was the 
result found, The reasoning is as follows : 
When the adjective appears before the let- 


Curs, since the letter can be expected just as 
in the blocked condition, 


k back to the semantic net- 
Work where an intersection with the adjec- 
tive will occur, Then the Subject must re- 
turn again to the lexical network to retrieve 


the name, so there js an extra transit neces- 
Sary in this condition, 
Loftus has als 
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ing a category-letter pair (e.g. "fruit-P"), 
which asked the subject for an appropriate 
instance, then, following 0, 1, or 2 interven- 
ing items, showing the same category paired 
with a different letter (eg., “fruit-A”), 
which asked for a different instance. The 
general finding is that reaction time for the 
second instance is shorter than reaction time 
for the first instance and increases mono- 
tonically with the number of intervening 
items. For example, in Loftus (1973a) a 
subject’s baseline time to name a fruit be- 
ginning with the letter “P” was 1.52 sec. 
However, it took him 1.22 sec to produce the 
same response if he had named a different 
fruit on the previous trial and 1.29 sec to 
produce the response if he had named a dif- 
ferent fruit two trials back. 

The spreading-activation theory predicts 
these results by assuming that when an item 
is processed, other items are activated to 
the extent that they are closely related to 
that item. That is, retrieving one category 
member produces a spread of activation to 
other category members, facilitating their 
later retrieval. The assumption (Assump- 
tion 3) that activation decreases over time 
or trials predicts the lag effect. 

Meyer and Schvaneveldt (Meyer, 1973; 
Meyer & Schvaneveldt, 1971; Schvaneveldt 
& Meyer, 1973; Meyer, Schvaneveldt, & 
Ruddy, Note 3) have also shown that the 
time to retrieve information from memory 
is faster if related information has been ac- 
cessed a short time previously. Their para- 
digm is somewhat different. Subjects were 
required to classify letter strings as words 
or nonwords. The general finding was that 
the response time to classify a letter string 
d is faster if the subject has just 
semantically similar word as Op- 
mantically dissimilar word. 
me it takes to clas- 
s faster if “butter” 
it is preceded 
ye led Meyer 
ation in terms 
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Juola and Atkinson's Study with 
Multiple Categories 

An increase in reaction time with multiple 
categories has been found by Juola and At- 
kinson (1971) in a task where subjects had 
to decide whether a stimulus word belonged 
to one of a variable number (1-4) of pre- 
specified (target) categories. They com- 
pared this task with one where subjects de- 
cided if the stimulus word was the same as 
one of a variable number (1-4) of target 
words. Their experiment was designed to 
distinguish between two kinds of models, 
one they attribute to Landauer and Freed- 
man (1968) and one they attribute to Col- 
lins and Quillian (19702). In most respects, 
their results fit the model they derived from 
Landauer and Freedman, but since the 
spreading-activation theory provides an al- 
ternative explanation for their results, we 
want to compare their two models with our 
theory. 

The model Juola and Atkinson (1971) 
derived from Landauer and Freedman 
(1968) is very similar to what Landauer 
and Meyer (1972) call the “category-search 
model.” It assumes that the subject searches 
through instances of the categories in mem- 
ory seeking a match for the stimulus word. 
Such a model predicts that as the number 
of categories or words in the memory set 
increases, reaction time for the category- 
matching task should increase at a greater 
slope than reaction time for the word-match- 
ing task. This is because each additional 
target category adds more instances that 
must be searched, whereas each additional 
target word only adds one, the word itself. 
This result was essentially what Juola and 
Atkinson found. 

The model they ascribed to Collins and 
Quillian (1970a) assumed that subjects per- 
form the category-matching task by retriev- 
ing their stored category for the stimulus 
word and comparing this to the given cate- 
gories to see if it matches one of them. This 
model would predict that the slope for the 
two tasks should be about the same, and 
the intercept for the category-matching task 
should be greater than for the word-match- 
ing task. Their results clearly reject this 
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model. Although attributed to Collins and 
Quillian, this model is quite different from 
Quillian’s theory, because the semantic 
search in Quillian’s (1966) theory is as- 
sumed to spread in parallel from both cate- 
gories and instances. When the categories 
are given first, as in Juola and Atkinson's 
experiment, then activation would spread out 
from the categories before the instance even 
appeared. s . ] 
In order to explain our Interpretation of 
Juola and Atkinson's results, it is necessary 
to describe their procedure in more detail. 
They chose 10 large categories and 12 com- 
mon instances from each category as stimuli, 
This makes a total of 120 instances in all. 
In the word-matching task, they presented 
from 1 to 4 of the 120 instances as targets on 
each trial. In the category-matching task, 
they presented from 1 to 4 of the 10 cate- 
gories as targets on each trial. In both 
cases the negative stimuli were chosen from 
the same set of 120 instances, | In the word- 
matching task, then, the discrimination nec- 
essary to categorize the stimulus was be- 
tween one of the target words or à word 
that had not occurred as a target for a large 
number of trials (on the average about 24 
trials earlier), In the category-matching 
task, however, the discrimination was be- 
tween a word in one of the target categories 
and a word in one of the categories 
from a recent trial (on the average about 2 
trials earlier). The discrimination, there- 
fore, was rather easy in the word- 
matching task and quite difficult in the cate- 
gory-matching task, 
What we think must be ha 
task is that the discriminatio; 
tive and negative responses is 
partly) on the basis of activa 
the category-matching task, 
of categories increases, 
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but “body part" was a target on a previous 
trial. Then “tree” will have a higher activa- 
tion level than “body part,” but the differ- 
ence will not be very large because “body 
part” was presented so recently. Ifa posi- 
tive instance such as “oak” is presented, it 
will intersect with "tree" ; 


; if a negative in- 
stance such as “arm” is presented, it will 


intersect with "body part" The less activa- 
tion on "tree," which depends on how many 
other targets there are, the harder it is to 
discriminate that it is in fact a target or 


rt” is not a target. The more 


say, each nontarget wa 
SO many trials Previously 
24) that the activat 


à aces due to target- 
atively unimportant, 


9 aspects of 
at the data 
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There are two implications of this view 
that could be tested fairly easily. One is 
that reaction time to decide that an instance 
such as “arm” is a negative instance will 
depend on the recency with which “body 
part" was presented as a target category. 
The more recent its presentation, the longer 
it will take to say “no.” A more global im- 
plication is simply that the slope of the curve 
with respect to target size depends on those 
factors that affect the difficulty of discrimi- 
nation, such as the recency with which nega- 
tive instances occurred as targets (and prob- 
ably as nontargets as well). This has im- 
portant implications for the memory-search 
literature as a whole. 


Conrad's Study 


Using a true-false reaction time technique 
for sentences (e.g. the task is to decide 
whether "A salmon can eat" is true or 
false), Conrad (1972) found results which 
she interpreted as contradictory to Quillian's 
(1966, 1969) theory of semantic processing. 
Tn fact, the results of her study are quite 
close to what Quillian's theory would pre- 
dict given Conrad’s methodology. 

In her first experiment, which was like 
the Collins and Quillian (1969) study, Con- 
rad selected 2-level and 3-level hierarchies 
from the common culture (e.g. salmon => 
fish animal) and properties associated with 
the objects at different levels. Then she 
constructed sentences with instances, such as 
salmon,” from the lowest level and prop- 
erties from all three levels. The results 
Collins and Quillian found were that re- 
action time increased as the property was 
farther removed from the instance in the 
hierarchy. The reason for the increases in 
reaction time according to spreading-activa- 
tion theory is that as the instance and pue 
erty are farther apart in the hierarchy, 1 
takes activation longer to 3 


them an o trigger at inte 


1972) broke 


; into three groups n dct 
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the different objects in the hierarchies. An- 
other difference from Collins and Quillian’s 
study is that she collected data over 5 days 
by repeating all the sentences each day. 

The results of her first experiment were 
generally in the same direction as the in- 
creases in reaction time that Collins and 
Quillian found when the property was far- 
ther removed from the instance in the hier- 
archy. There was one reversal in her data 
out of nine comparisons, and this occurred 
for the high-frequency properties, where it 
was not unexpected given the weak theory of 
cognitive economy (as we will argue below). 
However, the increases she found were much 
smaller on the average than those of Collins 
and Quillian. The weak theory of cognitive 
economy predicts that people store a prop- 
erty with whatever instance it is linked to 
in a sentence, SO Conrad’s repetition of sen- 
tences over 5 days should lead to the smaller 
reaction time increases she found. This is 
because an inference necessary on the first 
day would be less likely on the second day, 
and so on. Conrad, in fact, reports a large 
Level x Day interaction. 

In general, Conrad found that the higher 
the frequency of the property. the smaller 
the increases betwen levels. Given the weak 
theory of cognitive economy, we would ex- 
pect that high-frequency properties are more 
likely to be stored at several levels in the 
hierarchy, because they are more likely to be 
encountered in contexts involving specific 
instances. For example, "leaves" are more 
likely to be stored as a property with par- 
ticular tvpes of trees (such as “maple” and 
*"oak") than is "park," because leaves are a 
higher frequency property. Thus, the effect 
of property frequency found by Conrad is 
consistent with the weak theory. 

Conrad (1972) argued that Collins and 
Quillian's (1969) results could be explained 
by a confounding of property level and prop- 
erty frequency. Her argument was that the 
sentences. Collins and Quillian used may 
have been based on high-frequency proper- 
ties for Level 1 sentences (eg. “A salmon 
is pink”), moderate-frequency properties for 
2 sentences (e.g. *A salmon has 


Level m 
and low-frequency properties Tor 


fins”), 
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Level 3 sentences (eg, “A salmon can 
eat"). To support her argument, she showed 
that if one plots her reaction time data in 
the above way, one obtains approximately 
the same slope as Collins and Quillian did, 
whereas if one plots the slope for low-, me- 
dium-, or high-frequency properties sepa- 
rately, one obtains much smaller slopes. j 
However, there are two weaknesses in 
Conrad's argument. First, she did not use 
her frequency data to evaluate Systematically 
the frequency of the properties in Collins 
and Quillian's sentences, so her conjecture 
about such a confounding had no empirical 
basis. Collins and Quillian (1969) did ob- 
tain subject ratings of importance of the 
property for the relevant level concept, 
which should correlate with Conrad’s 
measure. These ratings average 
ROG foe Level 1 sentences, 1.92 for Level 2 
sentences, and 2.16 for Level 3 sentences 
(based on a 5-point scale, where 1 = very 
important and 5= not important), These 
are small differences and certainly do not 
support the notion that the slope between 
Level 1 and Level 2 Sentences was due 
to the confounding Conrad hypothesized, 
The difference between Level 3 sentences 


atter group acted to 
cancel out any exaggeration of the 


Second, the ¢ (1972) 


made in plotting her data against Collins and 
Quillian’s data compared data p 


and Quillian’s data. But even that compari- 
son has the problem that she may well have 
included sentences of the kind, “A maple 
has leaves,” where the property is a general 
Property of trees, but where most people 
would store it as a Property of maples as 
well. This Suggests that the fairest com- 
Parison is between Collins and Quillian's 
data and her data on the first day for low- 
frequency Properties, where the properties 
were least likely to be stored at more than 
one level. But We cannot make this com- 
Parison because she did not break down her 
In conclusion, the differences 
between the two experiments and the fact 
that the only relevant dat: 
larly support th 
founding of Property leve] 
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theory than her first experiment is evidence 
for the theory. 


Effects of Typicality and Semantic Related- 


ness in Categorization Tasks 


In recent experiments, Rips et al. (1973), 
Rosch (1973), and Smith et al. (1974) have 
shown that reaction time in a categorization 
task corresponds very closely to ratings of 
how typical the instance is of the category. 
For example, robins and sparrows are con- 
sidered typical birds whereas chickens and 
geese are not. The effect of typicality on 
reaction time is quite large even when fre- 
quency of the particular instances in the lan- 
guage is controlled. Like Smith et al., we 
would argue that the typicality effect is one 
more manifestation of the fact that semantic 
similarity speeds up positive decisions and 
slows down negative decisions. Such an 
effect has been found repeatedly (Collins & 
Quillian, 1969, 1970a, 1972b; Schaeffer & 
Wallace, 1969, 1970; Wilkins, 1971). While 
Landauer and Meyer (1972) argued that 
the evidence for similarity effects at that 
time was either questionable or artifactual, 
the evidence now seems SO overwhelming 
that any viable theory must account for 
them. They are very damaging to the cate- 
gory-search model. 

There are two reasons why spreading- 
activation. theory predicts that atypical in- 
stances will take longer to categorize than 
typical instances. The most important rea- 
son derives from the way evidence is aggre- 
gated (see Assumptions 8-13). Because 
different connections that are found are com- 
bined as evidence, distinguishing properties 
can slow down à positive decision based on 
a superordinate connection or On Logo 
properties. For example, the prone i 
a chicken is à bird (ie, an atypica i 

ight be made on the basis of a 
tion from “chicken” to 
arn because chickens 


i , referred to as pirds. But the 
P pide eat chickens, that they are 
d d farms, and that they are rather 
e 1l properties that distinguisn 
me z m most bir If these distin- 
th operties are found during the 
guishing 
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memory search, as some are likely to be, 
they act to slow down the positive decision, 
because they are negative evidence. Simi- 
larly, matching properties can slow down a 
negative decision. For example, the deci- 
sion that a goose is not à duck might be 
made on the basis of the difference in their 
necks (a distinguishing property) or simply 
because they are stored as mutually exclu- 
sive kinds of birds (see Assumption 12), 
but the matching properties that are found 
(e.g. their affinity to ponds, their webbed 
feet, their large size) will slow down the 
decision that they are different. The argu- 
ment here is similar to that of Smith et al. 
(1974), which we will discuss in comparing 
the two theories. 

The second reason for the typicality ef- 
fects relates to those cases where a super- 
ordinate connection is found. As we indi- 
cated earlier, superordinate links differ in 
accessibility (or strength), and accessibility 
depends on use. Tf a person frequently uses 
the link that a robin is a bird, and less fre- 
quently uses the link that a chicken is a bird 
(assuming approximately equal frequency 
for chickens and robins), then the accessi- 
bility of "bird" from “robin” will be greater 
than from “chicken.” Because of this, ac- 
cessibility will be highly correlated with 
typicality ratings. All the factors acting to 
make a chicken or a goose an atypical bird 
in the real world will also act to make the 
use of the superordinate link in a person's 
mind from "chicken" or “goose” to “bird” 
infrequent. It is because they are atypical 
that the superordinate link is weak, and this 
will also act to slow down reaction time 
in making categorization judgments about 
atypical instances. 

The way evidence is aggregated in the 
theory also explains the common finding 
(Collins & Quillian, 1970a, 1972b; Holyoak 
& Glass, 1975; Rips et al., 1974) that people 
are fast to decide that semantically unrelated 
concepts are different (e.g., that a book is 
not a dog). In comparing such concepts, 
there are not likely to be any superordinate 
connections, and almost all property con- 
Ive distinguishing rather 


nections will invo 
than matching properties. Therefore, almost 
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all the connections found will constitute neg- 
ative evidence, and subjects will be quite 
fast to reach the negative criterion in such 
cases. This too is similar to the explana- 
tion in the Smith et al (1974) model, 
Recently, Holyoak and Glass (1975) have 
isolated two different cases where semantic 
relatedness or typicality does not Produce 
the usual effect on reaction time for nega- 
tive judgments. One case arises when the 
decision depends on what they call a contra- 
diction and what we have called mutually 


exclusive subordinates, The other case 
arises when the decision depends on a 
counterexample. 


In the first case, Holyoak and Glass found 
that people are faster to Teject sentences 
such as “All fruits are vegetables” or “Some 
chairs are tables” than sentences such as 
“All fruits are flowers” or “Some chairs are 
beds." In these four Sentences the two 


"vegetables" and “tables” are generated with 
high frequency, while “flowers” and "beds" 


the two Concepts a 
semantically, 


properties 


how long it Will take to fir 
tion between the 
concepts, 


The second finding of Holyoak and Glass 
(1975) involves sentences where People re- 
ject the sentence by finding a counterexam- 
ple (Assumption 13). For example, “All 
animals are birds,” can be rejec 
ing another kind of anima ; 
mal. 
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producing a counterexample. 
Here again Where a decision Strategy that is 


not based on distinguishing Properties is 
appropriate, the reaction time data do not 
depend on the semantic relatedness of the 
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theory and point out what we think are its 
inherent difficulties. 

In the model of Smith et al. (1974), the 
meaning of a concept is assumed to be rep- 
resented by semantic features of two kinds: 
defining features and characteristic features. 
Defining features are those that an instance 
must have to be a member of the concept, 
and the model assumes that features can be 
more or less defining. Characteristic fea- 
tures are those that are commonly associated 
with the concept, but are not necessary for 
concept membership. For example, “wings” 
might be a defining feature of "birds" and 
"flying" a characteristic feature, since all 
birds have wings but not all fly. In a cat- 
egorization task, the model assumes that the 
two concepts are first compared in Stage 1 
with respect to all their features, both char- 
acteristic and defining. If the match is above 
a positive criterion, the subject answers 
“yes”; if it is below a negative criterion, the 
subject answers "no"; and if it is in-be- 
tween, the subject makes a second compar- 
ison in Stage 2 based on just the defining 
features. If the instance has all the defining 
features of the category, the subject says 
“yes” and otherwise says “no.” If the sub- 
ject can decide in Stage 1, his reaction time 
will be faster than if he decides in Stage 2. 

There are several minor differences be- 
tween the model of Smith et al. (1974) and 
the spreading-activation theory that could be 
minimized by slightly changing their model. 
The difference in wording between compar- 
ing features in their model and finding links 
between properties in our theory is really 
a nondifference. But the distinction between 
defining and characteristic features has the 
inherent difficulty, pointed out through the 
ages, that there is no feature that is abso- 
lutely necessary for any category. For ex- 
ample, if one removes the wings from a bird, 
it does not stop being a bird. Furthermore, 
we doubt if people can make consistent de- 


ae DU; 

4 There is for living things a biologists’ taxonomy, 
which categorizes objects using properties that are 
not always those most apparent to the. layman. 
Thus, there are arbitrary, technical definitions that 
are different from the layman’s ill-defined concepts, 
but this is not true in most domains. There is no 
technical definition of a game, a vehicle, or a 
country that is generally accepted, 
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cisions as to whether a feature is defining 
or characteristic, either from time to time or 
from one person to another. Smith et al. 
recognized that features are more or less 
defining (or criterial), but they were forced 
into making the artificial distinction between 
defining and characteristic in order to have 
a two-stage model. Still, the model could 
be revised to work without the two stages 
and make essentially the same reaction time 
predictions. 

The revision is as follows: If features are 
compared over time, as in Quillian’s (1966) 
theory, then as the process goes on longer, 
more features will be compared (assuming 
features have different accessibilities). The 
comparison process can have a positive cri- 
terion and a negative criterion just as before, 
and features can be weighted by their cri- 
teriality. If the match at any point in time 
is above the positive criterion, the subject 
says "yes"; if the match falls below the 
negative criterion, the subject says “no”; 
and otherwise he goes on comparing fea- 
tures. Finally, if he is running out of rel- 
evant information, he says "I don't know." 
This is simply the Bayesian decision model 
described in Assumption 9 of the extended 
theory, where the evidence consists of 
matching and mismatching features as in the 
property comparison of Assumption 11. 

Thus, we agree that a decision process 
similar to the one that Smith et al. (1974) 
postulate does occur for some categorization 
decisions. But there is a fundamental dis- 
agreement, because they argue that all cat- 
egorizations judgments are made by compar- 
ing features of the instance and category, 
whereas we argue that people use whatever 
evidence they find, including superordinate 
links. 

Because they exclude the use of superordi- 
nate links, the model of Smith et al. has sev- 
eral inherent difficulties. The most obvious 
is the assumption that even when people 
have superordinate information stored, they 
do not use it. While most people may not 
have learned some superordinate relations 
(e.g., that a beaver is a mammal, or a sled 
is a vehicle), there are many they have 
learned (e.g. that a wren is a bird, and a 
beaver is an animal). Why would they not 
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use such information if it is stored? How 
in fact can they avoid using it? It is an 
unlikely model which postulates that people 
use information that is less relevant to make 
a decision, instead of information that is 
more relevant. 
Another obvious 
et al. (1974) mog 


But people gen- 
no idea what the defining prop- 
erties of à mammal, an animal, a bird, or a 
sparrow are, Even if they know that one 
of the most criterial properties for being a 
mammal is that it bears its young alive, 
it seems highly unlikely that they know 
Whether whales (or beavers for that matter) 
bear their young alive. Neither of the 
authors has any idea what properties of a 
Sponge make it an animal, but if asked in 
an experiment whether 4 Sponge was an an- 
imal, we would answer “yes,” and we would 
be comparatively slow about it. The reason 
we would answer “yes” is simply that we 
were told at one time that a Sponge is an 
animal. We were also told that a bat is not 
a bird, and if we had not been told, we fear 
we might have responded “yes c asked 


whether a bat is a bird in a categorization 
experiment, ta wren is not 


atures of à 


Sparrow are that a wren does not have, 


fact that there 
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information 
decision 
properties, of the strongest 
arguments for a hybrid theory, 

We would like to Close this Section p, 
raising the question of why one should adopt 
such a complicated theory when the Smith 
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et al. (1974) model is simpler and predicts 
the reaction time data quite well. We lave 
tried to stress the inherent difficulties that 
their model has in ignoring superordinate 
information and in relying on defining Prop- 
erties. Experimental tests can pu 
devised that will show up those difficulties. 
We will suggest one such test, but EUM 
might point out that the results of the Lo = 
(1973b) categorization experiment describe 

earlier do not fit the Smith et al, model very 
well. If a person is merely comparing fea- 
tures between the instance and the category, 
then it should not matter whether the in- 
stance or category i; It is 


€ superordinate connec- 
the asymmetry Loftus 
found in reaction time, and it is hard to 
imagine how one could have an asymmetry 
of that kind in comparing features of two 
concepts, 

One experiment th 
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such as the Smith et al. model, makes the 
opposite prediction. So this is a possible 
test of the two theories. There are un- 
doubtedly many other tests. 

Finally, we want to explain why we have 
been led to such a complicated theory. In 
trying to write computer programs that an- 
swer different types of questions, it becomes 
apparent that any decision procedure that 
gives correct answers must be flexible 
enough to deal with many different config- 
urations of knowledge in memory. This is 
because people have incomplete knowledge 
about the world (see Collins et al., 1975), 
and they often do not have stored particular 
superordinate links or criterial properties. 
Any realistic data base for a computer sys- 
tem will have this same kind of incomplete 
knowledge. "Therefore, perhaps our strong- 
est criticism of the Smith et al. (1974) 
model is that it breaks down when people 
lack knowledge about defining features. 

While at one level this is a complicated 
theory, at another level it is a simpler theory 
than the Smith et al. model. By viewing 
superordinate links as highly criterial prop- 
erties, the theory becomes a simple Bayesian 
model. It is only in specifying the particular 
configurations of knowledge that constitute 
positive or negative evidence for the Bayes- 
ian process that the theory becomes compli- 
cated. The difference between the two the- 
ories is that the Smith et al. model allows 
only one kind of evidence (matching or mis- 
matching features), whereas the theory pre- 
sented here allows other kinds of evidence 
as well. Thus the theory encompasses a 
revised version of the Smith et al. model as 
a special case of a more general procedure. 


CONCLUSION 


We have extended Quillian’s spreading- 
activation theory of semantic processing in 
order to deal with a number of experiments 
that have been performed on semantic mem- 
ory in recent years. The result is a fairly 
complicated theory with enough generality 
to apply to results from many different ex- 
perimental paradigms. The theory can also 
be considered as a prescription for building 
human semantic processing in a computer, 
though at that level many details are omitted 


about decision strategies for different judg- 
ments that arise in language processing (see 
Carbonell & Collins, 1973; Collins et al., 
1975; Quillian, 1969). We would argue 
that the adequacy of a psychological theory 
should no longer be measured solely by its 
ability to predict experimental data. It is 
also important that a theory be sufficiently 
powerful to produce the behavior that it 
purports to explain. 
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Subjective Duration and Psychophysics 


Hannes Eisler 
University of Stockholm, Stockholm, Sweden 


Three models concerned with the behavior of subjects estimating two suc- 


cessive durations are proposed. The a 


ccepted model assumes that the sub- 


jective total duration (sum oí first and second durations) and the second 
duration are each accumulated in a separate sensory register. In a duration- 
matching experiment, for instance, the difference between the contents of 


the two registers is matched to the content of the second register. 
(a) it does not include any memory, thereby 


model is accepted because 


This 


eliminating certain difficulties connected with coding and storing of duration, 
(b) it copes with characteristic features of duration discrimination, and 
(c) it can satisfactorily explain data obtained in four scaling experiments, 
namely, magnitude estimation, matching, halving, and doubling of 10 dura- 


tions between 1.3 and 20 sec. 


As a by-product, Stevens’ 


power law is 


uniquely derived, and exponents are computed from matching data, thus 


eliminating the subjects’ numerical behavior. 


the time-order error for time. 


The present investigation can be seen as 
lying at the intersection of two problem 
complexes: the validity of psychophysical 
theory for direct scaling and the psycho- 

tes of time perception. The connection 

tha  “{uitous—as will be shown, it is 

git ly the peculiarities of the time sense 
tnat make it possible to validate direct psy- 
chophysical scaling methods, to demonstrate 
that Stevens’ power law is the only possible 
psychophysical function, and to determine 
the exponent for duration without involving 
the subjects’ numerical behavior. 

Three models are proposed to describe 
the strategy applied by a subject (or, per- 
haps better, the process in a subject) when 
he is confronted with two successive time 
intervals and is required to deal with some 
relation between them, for example, by tell- 
ing which was the longer or by adjusting the 
second to match the first. The models are 
independent of the choice of any particular 
subjective scale (psychophysical function), 
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The model also accounts for 


but a number of nonquantitative arguments 
seem to favor one of them. In order to 
compare the models quantitatively as well as 
connect them with scaling theory, a series of 
experiments was carried out, namely, magni- 
tude estimation (ME), equal setting (ES), 
halving (HS), and doubling (DS) of 10 
durations varying between 1.3 and 20 sec. 
The typical outcome of ES for one subject is 
taken as a starting point to derive Stevens’ 
power law and other psychophysical rela- 
tions. Based on this theoretical develop- 
ment, the experimental results are dealt with 
and the three models are reexamined in the 
light of these findings. 

A closer analysis of a subject’s task when 
dealing with two successive durations brings 
out two problems specifically pertinent to the 
perception of time: 

1. The experimenter is not really in con- 
trol of the stimulus: Time continues to 
elapse after the end of the first duration, 
during the interstimulus interval (ISI), and 
during the second stimulus duration. We 
might thus ask whether a subject is capable 
of ignoring the steady flow of time and fo- 
cusing on the limited interval presented 
while waiting until the response can be 
given. 

2. According to the traditional view, the 
subject has to perceive the first duration 
and retain "it" in memory until the offset 
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Iodel In subjective terms 
M 
- «(bs — Pos)? (dy. — doy.) 
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y an argument from 
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(1 and 2) in different Ways or fail to treat Model c 
an 
them at all. 


assumes agreement between ex- 
Perimenter and j 
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able, but t 
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Fıcure 1, Block diagrams for the three models. PG = pulse-controlled 
gate (closed at start of cycle) ; LTM = long-term memory; @ = compara- 
tor; m =a pulse. 
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Short 
pause N 


The content of a long-term memory is coded The properties of a long-term memory and 
more or less efficiently. i (c) The content of a sensory register, as listed above, are only 
a sensory register typically changes con- sketched and call for qualifications, which, 
tinuously, though not necessarily throughout however, are not important in the present 
the period of its use. The content of any context. Unlike Point (c), Points (a) and 
memory is fixed. (b) indicate features that a sensory register 
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might share with a short-term or an iconic 
memory. i AP 

The Dus are drawn with the experi- 
ments mentioned 
mind. A "cycle" is à Sequence of two dura- 
tions, standard and variable, Separated by a 
; The durations are 
The subject 
turns off the variable by pressing a button 
when it is experienced as equal to the stan- 


According to Model A, the Sensory regis- 
ter is cleared at the start of the Cycle. It 


by a pulse, 
Storing in the LTM of the content of the 
Sensory register. When this has been ef- 
fected, a new pulse resets the Sensory regis. 
ter and opens the two pulse-controlled gates 
to the comparator. There the content of 
the LTM is continuously compared with the 
content of the Sensory register. We see that 
if the time required for coding and storing 
exceeds the ISI, the first part of the variable 
duration will be lost, making it impossible 
for the subject to fulfill his task in the in- 
tended manner, 

This is not the case in Mode] B, which 
differs from Model A in that the Sensory 
register is not reset after the transfer of its 
content to the LTM. In the comparator the 
content of the LTM is continuously com. 
pared with the computer output, Which js 
the difference between the total and the stan- 
dard duration, 
his task as intended except for cases where 
both the ISI and the variable are extremely 
short. Then he misses the right moment to 

5 e button, 

d eds both Model A and Mode] p 
make use of the LTM, this is replaced by a 
second sensory register in Model C, Both 
Sensory registers are cleared at the start of 
the cycle, and the first one accumulates the 
total time. The short pause pulse Starts the 
second register, where the variable duration 
then accumulates, and Opens the two gates, 
In the comparator the variable duration is 
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continuously compared with the computer 
output, which is the difference between the 
total and the variable duration, 


The following analogy may make the pro- 
cesses more understandable, 


no scale or mark except 
hands moye linearly, 


Watches and 2 are 
Stop-watches ; Watch 4 oes not have any 
works, but the hand is adjustab] In all 
cases two of the three wa 
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at the o Set of 47^ Standar and Watch > 
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f Wate Rs Zer Point and the hand 
hang; = S th tance | tween the 
the bug Oving), he subj Presses 
It is l 
t ily reali 
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Finally, let us note the following. If 
the movements of the hands are uniform, the 
outcome (moment of button press) is the 
same for all three models. This would cor- 
respond to a veridical perception of time, 
-and the three models would be indistinguish- 
able. Other than proportional psychophysi- 
cal functions correspond to accelerated or re- 
tarded movements of the hands in the anal- 
ogy, entailing different outcomes for the 
three models. 


DURATION DISCRIMINATION AND THE 
THREE MopELs 


Allan and Kristofferson (1974) list three 
findings that characterize duration discrimi- 
nation : 

1. A subject’s ability to discriminate a 
constant difference in duration is indepen- 
dent of the duration values. 

2. Discriminability between two durations 
is independent of the ISI. 

3. Forced-choice performance is not su- 
perior to single-stimulus performance, 

The first two statements need some quali- 
fication: The first statement seems to be 
valid only for practiced observers, and the 
second holds only for ISIs exceeding 500 
msec; for shorter ISIs discriminability de- 
teriorates. 

We will examine these findings within the 
frame of reference of our three models. For 
the case of discrimination between two suc- 
cessive durations the subject’s response de- 
pends on the output from the comparator at 
the offset of the second duration : +, —, or 0. 

In continua other than time (as well as for 
unpracticed observers in duration discrimi- 
nation), discriminability deteriorates with 
increasing stimulus magnitude for a given 
difference, particularly when the stimuli are 
presented in succession. This may be at- 
tributed to the increase in variability of sub- 
jective magnitude with magnitude, particu- 
larly for the memory of the first stimulus. 
Time perception according to Models A and 
B should behave in the same way. Model 
C, however, can explain why the length of 
the duration does not affect discriminability 
(Allan and Kristofferson’s first finding). 
First, no memory is involved. Second, if 
we assume (a) that there is a correlation of 
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unity between the errors in the outputs from 
the two sensory registers—reasonable be- 
cause of the simultaneity—and (b) that the 
errors are roughly proportional to the du- 
rations,! they become the same for both in- 
puts to the comparator and thus cancel: 
Vr + 2e — (iy + e) yields an error of e, that 
is, the same as in yp. (Perhaps it may be 
concluded that a naive observer behaves in 
accordance with Models A or B and only a 
practiced one with Model C.) 

'The influence of the ISI found for con- 
tinua other than time may be attributed to 
effects on the memory that progress with 
the length of the ISI and possibly to dis- 
turbances in the perceiving of the second 
stimulus while the first is being coded and 
stored. Since Model C does not require any 
memory, it eliminates this factor and thus 
conforms to Allan and Kristofferson's second 
finding. One would have to assume, how- 
ever, that Sensory Register 1 is switched off 
during the ISI. The disruption of perform- 
ance for ISIs < 500 msec might be due to 
the time required for switching Sensory 
Register 1 on and off. 

Allan and Kristofferson's third finding is 
predicted by Model C under the assumption 
that the subject does not follow the single- 
stimulus instructions as intended by the ex- 
perimenter. Even singly presented dura- 
tions are given in a sequence, and rather 
than storing the standard in the LTM and 
comparing it with the current stimulus, the 
subject compares the current stimulus with 
the preceding one in the way suggested by 
Model C. If the output from the compara- 
tor is +, his answer is “short,” if it is —, 
“long.” After each response the register 
holding the total duration is cleared, and the 
tasks of the two registers are switched with 
each new stimulus, so that the new duration 
accumulates in the presently cleared regis- 
ter, whereas the other register continues to 
accumulate the new duration on top of the 
previous one. This implies that the subject 
is carrying out a forced-choice task even 
when instructed according to single stimulus, 
and that a single-stimulus experiment proper 
simply cannot be carried out with durations. 

1 Ekman’s law (Ekman, 1956, 1959). 
Eisler (1962) and Teghtsoonian (1974). 


See also 
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The advantage of the forced-choice method 
in always presenting two different durations 
in a trial is offset by the subjects oppor- 
tunity of repeating his latest answer when 
the output from the comparator is 0. How- 
ever, though the overall results of single 
stimulus and forced choice should be ap- 
proximately the same according to Model C, 
a difference should be detectable in the mi- 
crostructure of the data: In single stimulus 
both a correct and an incorrect judgment 
should show a tendency to perpetuate itself 
as long as the next stimulus equals the pre- 
ceding one. All of these considerations 
favor Model C. 


Ratio SCALING OF SUBJECTIVE 
DURATION : THEORY 


Most of what is said in this section is 
pertinent also for continua other than time. 
Stevens’ power law will be derived from 
precisely specified assumptions. Further- 
more, it will be shown that combining Model 
C with the power law has two interesting 
consequences: (a) possible alternative psy- 
chophysical functions (cf. Treisman, 1964) 
have to be rejected, and (b) the exponent 
can be estimated from equal-setting data, 
that is, without involving the subject’s nu- 
merical behavior. 

The theoretical development is under- 
taken with the results of the ES experiment 
for Subject 1 as the starting point. In Fig- 
ure 2A these data are given as a plot, where 
the variable duration is plotted against the 
total duration, in accordance with Model C. 
This particular plot has been chosen for 
demonstration because it gives the clearest 
picture of the main features of all 36 plots 
(3 experiments, 12 subjects) with an almost 
even distribution of the 10 data points 
among three straight lines. (As to fit, it 
ranks only 27th among the 36 sets of data.) 
The other plots are similar, though a number 
of them yield only two straight lines, and a 
few only one. (Four plots show slight curvi- 
linearity.) Two features are apparent in 
Figure 2A: The plot is segmented, in that 
it shows discontinuities, and the segments 


are linear. 


The Psychophysical Function 


The linearity observed does not depend on 
the choice of model. It is easily shown that 
a plot of variable against standard duration 
(Model A) or of standard duration against 
total duration (Model B) likewise is linear. 

The derivation given below is based on 
two definitions, two assumptions, and one 
empirical finding. It will be shown later 
that one of the assumptions follows from 
Model C; the other assumption is confirmed 
by the experimental results. 

The empirical linearity of the data can be 
described by? 


dy — abb (1) 


(empirical finding), where © denotes the 
physical value of the standard duration (in 
general, here of the total duration), and dy 
the variable duration, Furthermore, we have ` 


y —f(9) (2) 
(first definition), and 
yy = g(9v) (3) 


(second definition), where y denotes sub- 
jective values and f and g the psychophysical 
functions for standard and variable dura- 
tions, respectively. If the subject carried 
out an r setting (r=1, .5, and 2 for ES, 
HS, and DS, respectively), we have 


Wary (4) 
(first assumption). Equations 1-4 yield 
rf(®) = g(a? + b). (5) 


The functional Equation 5 is too general 
to allow a solution. We can pick f at will 
and can always find the corresponding g. 
However, a simple restriction makes Equa- 
tion 5 uniquely solvable. : 

The second assumption is that the two 
psychophysical functions, f and g, are inde- 
pendent of the particular task, that is, inde- 
pendent of r. In other words, the psycho- 
physical functions should be the same, re- 
gardless of what ratios are to be set by the 
subjects. This requirement is an obvious 
prerequisite for any psychophysical theory of 
some generality. This invariance assump- 


2A more detailed derivation of the psychophysi- 
cal function will be found in Eisler (1974). 
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tion entails that a variation in r is reflected 
only by a variation in a and possibly b. p 

Taking the derivative of Equation 5 with 
respect to y yields 


f(9) = (ab + b')-g' (ad + b), (6) 
and with respect to ® yields 
rf (9) — a-g'(a$ + D). (7) 


Dividing Equation 7 by Equation 6 yields 


TUR. x 
FO TET = 


and integrating Equation 8 with respect to ® 
yields 


r-Inf(®) = S Inja TI Cir), 
or 


F) = Cr) (a^ + panes. (9) 
Remembering that f(®) is ind 
(9) is rewritten in the follo 

Y = f(8) = a(b — $,)5, 


where «œ constitute 
the subjective zero, 
the psychophysical functi 


pendent of y, 
wing way: 


$4, (10) 


Yr = a! (by — $8, 
` Note that without any a 


nter. 
est), and that the PSychophysica] function 
derived in this Manner is Stevens Power 
law.? 


It has already been shown (Ekman, 1958; 
Eisler, Note 2) that à Power function i; 
compatible with a linear p-p relation; what 
is shown here is that ap 


ower function con- 
stitutes a unique solution to Equation 5, 


3 Note that this derivation refutes 
tion of the power law as Y — od! 4- y, 
cussion in Galanter & Messick, 1961), 


the formula- 
(cf. the dis- 
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Discontinuous Psychophysical Functions 


Consider Figure 2B, where subjective du- 
ration is plotted against physical duration. 
We see that the psychophysical function 
consists of two segments, that it exhibits a 
discontinuity, or break. Though breaks in 
psychophysical functions, particularly for the 
continuum of duration, have been found by 
quite a number of researchers (Eisler & Ek- 
man, 1959; Reese, 1943; Ross & Katchmar, 
1951; Taves, 
haeuser, N 
neglected 


This parti- 


aS. ee groups is re- 
flected in Figure 2A, where d, i plotted 


against ®: the lowest and the uppermost 
Straight lines describe the regions where the 


O the transition re- 
the standards lie on the upper 
'ariables on th 
break in 
re 2B) re 


A. z ote that the break of the data in Fig- 
€ 2 is at Sec, ent = 
eal sky tailing à break at 4 sec 


ST at 4 sec on the 
ote further that the 
tror” when he makes 
Or different stimuli 
S isa Simple result of 
sMents that differ in at 


the Psychophysical 


axis ang anoth 
ON 


Psychophysical 
following way. 
and 11 into Equa- 
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tion 4 we obtain 
ra(® — dx)? = a’ (by — P)’. 
Rearranging yields 


ra\8 ra NM 2 
Py = P "p — (s "Po + P'o. (12) 


Equating the coefficients of Equations 1 and 
12 we obtain (see Ekman, 1958; Eisler, 


Note 2) 
5)" 
zd = a, 
a 
that is, 
B = log(re/a') /log a, (13) 
and 


1/8 
= (£) 4A b. (14) 


Since the slope of the outer lines depends 
only on r and 8 (a =a), they must be 
parallel. For the case shown in Figure 2, 
moreover, their intercepts are the same, im- 
plying also the same subjective zero. This 
happens often but not always. The slope 
of the middle line depends on r, 8, and on 
the ratio «/a’ of the scale units. 

Denoting the slope of the outer lines by a, 
we obtain 


B = log r/log a. (15) 


Denoting the slope of the middle line by a', 
the ratio of the scale units is obtained : 


ofa! = (a'Ja)8. (16) 


The intercepts and slope of the outer lines 
yield the subjective zero(s) : 


o — b/(1— a), (17) 


where b is the intercept of the lowest line 
when it is the subjective zero of the lower 
segment of the psychophysical function that 
is sought, and of the uppermost line when 
it is the upper segment. Equation 14 pro- 
vides a check. Denoting the intercept of the 
middle line by b’, the subjective zero of the 
upper segment by ®o and of the lower seg- 
ment by P'o, we have 


—a'd, + y = V. (18) 


When the data show only two straight 
lines or the third line consists of only one 
or two points, Equation 18 may be used to 
compute d, or d'y as the case may be. 


Ratio Scales of Duration and the 
Three Models 


What has been said up to now about 
scales is valid whichever model one chooses. 
Table 1 describes the three models also in 
terms of the psychophysical function. Be- 
cause of the arbitrariness of the scale unit, 
it has been set to unity. If there are two 
segments with different scale units, the scale 
unit of the lower segment is set to 1, and 
that of the higher segment to a. Thus « may 
or may not equal unity in Table 1, 

A remarkable feature of Models B and C 
is that the exponent for subjective duration 
can be computed from equal setting, that is, 
without any use of numbers in the experi- 
ment. For Model C, the constant r in Equa- 
tion 5 becomes 1/2 (the variable subjective 
duration becomes half the total subjective 
duration). This constant of 1/2 belongs to 
the scientific meta language (Bergmann & 
Spence, 1944) from an epistemological 
point of view, since it emerged from the 
model, and not to the object language, as, 
for example, the constant .5 in a common 
half-setting experiment. 

This fact proves that there is no alterna- 
tive to the power function for the experi- 
ments described here by confirming the first 
assumption made in the derivation of Stev- 
ens’ power law. What amounts to halving 
in the meta language is equal setting in the 
object language. 

To recapitulate, the premises leading to 
the power function are as follows: 

1. The ratio settings are linearly related 
empirically to the standards. 

2. Model C (or B) is accepted, excluding 
possible other formulations of the subjects’ 
behavior such as Equation 5, at least for 
equal setting. 

3. The form of the psychophysical func- 
tion is assumed to be independent of the 
prescribed ratio, 

The first premise yields Equation 5, the 
uniqueness of which is guaranteed by the 
second premise, and the third premise sup- 
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EXPERIMENTS 

Experimental Design 
Using the same 10 time intervals as stimuli or 
standards, five experiments were carried out: re- 
action time (RT), magnitude estimation (ME) 
equal setting (ES), half 


setting (HS), and dou. 
tting (DS). The intervals (durations) were 
te ih as 30 ks 60, 8.1, 11.0, 148, and 206 


sec. Thus they ranged from 13 to 20.0 sec and 
were equally spaced in logarithmic units, They 
were presented acoustically, as a Strongly dis. 
torted tone from a small loudspeaker, 

Each experiment consisted of a n 


in immediate succession. 


(a) a long 
pause (silent interval), followed by (b) one of 
the standard durations, followed by (c) a short 
pause (silent interval), that is, 


$ 
Start of 
variable 


escribed in t 


ama è Physical time — 
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ng to Model C, 
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Button press: 
end of cycle 


Long pause 


Stimulus duration 


Response 
duration 


Short pause 


—— M i 
Time 


Ficure 4. A cycle. 


ject's part consisted of a metal box with a small 
red lamp and the loudspeaker. Attached to this 
box by a cable was a grip with a pushbutton that 
activated a microswitch when pressed. This part 
of the equipment was monitored by the experi- 
menter's part, which controlled the cycles, includ- 
ing the presentation of the standard durations and 
the registration of the response. The two pauses 
could be adjusted by two knobs and remained the 
same during any one experiment. The standard 
durations (30, in pseudorandom sequence of 10 
different durations, see above) were given by 
means of a punched mylar tape in a closed loop. 
A cycle started with the reading of the next stan- 
dard time and was terminated by the button press, 
which also initiated the next cycle. Standard time 
and variable time (response) were punched on 
paper tape (resolution: 10 msec) for later treat- 
ment with a high-speed computer. An experiment 
was stopped automatically after a predetermined 
number of cycles. The red lamp on the subject's 
box was switched off for the duration of the ex- 
iment, thus indicating to the subject the be- 


per: ; 
: ning and end of the experiment. 


gin 


Procedure and Subjects 
f psychology 
; rgradüate students o! c 
4 ge Re as part of their course require- 
ments All subjects took jar in all a etie 
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time. t " 
Ve end of an exprimi waon pis be Te 
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ie Dile nee ot but six subjects, for 
vx age oe iments was 
2 e a of the last two experime 
reversed, so that DS precede d "TI 


In RT the subjects were P i 
* Tis ambe | ficient to obtain 
closer €x- 


that the 


iu 


Broup data of acceptable 
: er, 
mination of the data revealed, however, 


tibiects behaved vastly differently, 


Feat the data individually. 


For explanation see text. 


cles, in each of the other four experiments with 
70 cycles. Since the number of cycles for each 
experiment was not a multiple of 30 (the number 
of cycles in a block), each experiment started, 
both intra- and interindividually, at a different po- 
sition in the block. 

The main purpose of the reaction-time experi- 
ment was to familiarize the subjects with the stim- 
ulus durations used, and it is therefore not con- 
sidered further. The reaction times varied be- 
tween 200 and 600 msec, most values being not far 
from 300 msec. 

The magnitude-estimation experiment was car- 
ried out with a free standard. The instruction 
stressed that the subjects were to rely on their 
spontaneous experience. They should not "think 
in terms of seconds," nor should they count. Each 
estimate was to be entered in a mimeographed 
table, after which the subject was to press the 
button to obtain the next stimulus duration. 

In the ratio-setting experiments the subjects 
were told that they had to listen to two durations, 
separated by a short pause. Pressing the button 
terminated the second duration, which was to be 
of the same length as (half the length of, twice 
the length of) the first one. Again, it was stressed 
that the task should be carried out in terms of 
spontaneous experience. 


Results 

In all four experiments the first 10 cycles 
were discarded as preliminary trials. Table 
2 gives the means of the remaining 60 cycles 
(see exceptions below) for all 12 subjects 
individually. 

Magnitude estimation. Every value in 
Table 2 is the arithmetic mean of six esti- 
mates, The standard deviations varied be- 
tween 0 and 60% of the mean estimates, 
with 30% as a rough average. — 

Ratio settings. Of the 2160 adjusted du- 


rations, 44 were discarded as outliers, pro- 
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Dy = 0 for 


TABLE 2 
n IBJECTS Invivipvatty, WITH ARITHMETIC MEANS of M 
NSES FOR THE 12 SUBJEC i y 
poris AND ARITHMETIC MEANS OF EQuaL, H 


AGNITUDE Estim 
ALF, AND Dounrg SETTINGS 


IN SECONDS 


ERR a 
Standard duration (seconds) 


ATIONS 


EE 33 45 90 go no as 20.0 
Subject 1.3 1.8 
j Se Dii a 
Magnitude estimations 
2 2.00 3.33 6.33 6.67 8.67 8.33 12.33 12.33 18.00 
i 300 4n jo 70 AU AE. io Ge DU 
1.50 1.67 2.0; 2.8: 3. 3.67 4.50 5.00 i 
A 3.00 333 7.33 8.33 E 00 1500 20.50 26.67 4 17 Pe 5 
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6 433 — 633 E: a3 zs 7 ri 40 467 e$ 85.83 96 67 
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1 250 292 i» m 48. ne 15.00 19°33 26.67 3607 
» - T 14.83 1839 2 8 73 67 100 13.83 
10 5.83 eee 25 417 5.00 E 38.33 31.67 66.67 — 9i 17 
s 1.67 X tsi. got ai; 949 p 00 — 16.50 21.00 
m 5.00 — 8: - 3333 — 4000 53.33 63'33 
qual Settings (seconds) = m 
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2 93 120 41, To qas E {De 435 725 1S 13/52 
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5 5» iu 15 wh X5 ARE EE C 833 — 19s 
6 1.02 145 286 39 A — E» $26 32 438 14.05 
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46 131 0: 239 SO 6.9 nee 6.57 7.65 
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TABLE 2— (Continued) 


Standard duration (seconds) 


Subject 1.3 1.8 2.5 3.3 45 6.0 8.1 11.0 14.8 20.0 
Half settings (seconds) 
1 A6 252) 71 1.12 141 1.99 2:20 3.08 3.85 5.41 
2 OF .69 1.16 1.52 2.16 3.35 3.84 4.59 7.02 9.41 
3 17 .62 1.05 1.37 2.03 2.75 3.77 6.43 6.38 12.90 
4 i31 AT 17 -96 1.44 1.94 2.38 3.10 4.87 6.70 
5 Ta 4 1.01 1.13 1.44 2.59 2.66 3.25 3.89 6.09 
6 .60 .86 1.65 2.42 2.74 4.45 4.89 6.56 8.43 10.01 
7 .70 1.29 2.38 3.07 3.06 4.74 4.66 5.39 6.98 7.31 
8 .50 wt 86 1.07 1.93 2.30 3.52 4.25 7.45 9.35 
9 8 05 1.11 1.15 1.99 2.30 2.96 3.73 4.12 4.06 
10 42 74 1.08 1.87 2.64 3.76 5.44 7.60 7.08 8.98 
11 56 58 .69 -76 1.20 1.89 2.41 3.74 5.25 5.67 
12 95 -92 1.63 2.72 2.34 3.26 4.45 5.80 7.18 8.10 
Double settings (seconds) 

i 2.17 2.28 3.00 3.83 3.65 6.73 7.47 7.89 8.66 11.57 
2 2.18 2.78 4.44 4.54 7.14 8.77 9.36 11.07 13.59 20.60 
3 3.78 3.60 3.93 4.74 6.94 8.91 9.11 13.68 16.13 20.48 
4 1.96 2.36 3.52 4.11 4.39 6.29 7.81 11.31 11.84 19.15 
5 3.34 3.98 6.38 5.89 11.87 8.28 12.16 15.81 14.54 22.01 
6 3.01 4.32 6.27 7.98 9.64 11.88 12.91 12.30 13.58 20.87 
7 2.28 3.11 5.12 8.04 7.59 9.26 10.22 13.28 16.12 19.43 
8 2.40 3.00 4.98 5.46 7.89 10.14 16.42 20.76 23.89 34.68 
9 2.04 2.66 3.38 4.62 6.67 7.97 8.72 10.25 11.15 16.51 
10 1.57 3.04 5.06 5.85 8.11 10.88 13.89 19.33 28.29 28.86 
11 2.16 2.55 4.93 5.04 8.13 11.84 13.57 17.39 16.57 24.01 
12 2.63 5.85 5.87 5.81 12.21 9.93 12.31 15.90 17.85 22.45 


one break at most. Four subjects either 
did not exhibit a break in their data or the 
fit did not improve noticeably on going from 
two to three parameters. For these subjects 
the data were fitted without any break. 

Table 3 gives the exponents, the ratios 
between scale units of the upper over the 
lower segment, and the positions of the 
breaks. The exponents range from .53 to 
1.12. Figure 5 gives the plots. 
Ratio-Setting Experiments 

For all 12 subjects individually, and all 
three experiments, y was plotted against 
(s + Pr), where ®g is the standard and 
&y the variable duration (from the experi- 
menter's point of view). This way of plot- 
ting the data agrees with Model C. Neither 
the short pause of 300 msec nor the reaction 
time of about 300 msec was taken into ac- 
count, These intervals affect only the sub- 
jective zero(s), which are of minor interest 


in this connection. In most cases the plots 
clearly showed whether one, two, or three 
straight lines should be fitted. The fitting 
was done with the method of least squares. 

Denoting #s + Py by 9$, we have, accord- 
ing to Model C, 


r[a(b — 49)? — (by — 9'9*] 


= (by — 4?, (19) 
r 
y = Ene. 


1/8 
PULL 


m 
» r1 


5 Allan and Kristofferson (1974) cite two dura- 
tion discrimination studies where pertormance de- 
teriorated with ISIs shorter than 500 msec. A 
comparison with the present experiments, with an 
ISI of 300 msec, is hardly justified, however, be- 
cause both the tasks and the sets of durations used 
were different. Perhaps it can be concluded that 
Sensory Register 1 was not "switched off" during 
the short pause in the present experiments. 


1/8 
alb + P'o. 
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Setting r/(r + 1) =, for r= sl, and2 
(HS, ES, and DS), we obtain b — 1/3, 1/2, 
and 2/3, respectively. 

For one straight line there were no prob- 
lems. If there were two lines, they were 
fitted independently, The three-line case is 
somewhat more complicated. The two outer 
lines must be parallel, and the three inter- 
cepts are not independent (since there are 
only two segments, there are only two sub- 
jective zeros). Fitting all 10 points simul- 
taneously with four parameters would have 
required the solution of a system of non- 
linear equations. This I did not consider 
worthwhile, Instead the two outer lines 
were fitted together (same slope), and the 
middle line separately, Let us consider the 
cases with one, two, and three straight lines 
one at a time, 

For a single straight line we assume a 
single psychophysical function (no break) 
and obtain 


B = log p/log a, 


according to Equation 15, and 4 from 
Equation 17. 

For two straight lines the main problem 
is which of the two belongs to the same 
segment and which is the transition line be- 
tween the two segments, Consider Figure 
2. In order to have no overlap of the two 
segments of the psychophysical function (ie., 
to have a unique mapping of 4 onto V), the 
highest ordinate of the straight line corre- 
sponding to the transition region (the mid- 
dle line in Figure 2A) must not lie above 
the lowest abscissa of the same straight line 
(see Eisler, Note 2). Usually the data were 
unequivocal; in a few cases other criteria 
had to be invoked. From the line not be- 
longing to the transition region, B and $$ 
were determined as described above; then 
« and do» were computed from the other 
line: 


a = a/b, deo = b' + a/dy, 


or 
Bo = (ba — b')/a’, 
depending on whether the transition line lay 


below or above the other line. (Subscripts 
1 and » for ®o refer to the two segments; 


Subjective duration (decilogs) 


Physical duration (log sec) 


Ficure 5. Log magnitude estimates as functions 
of log stimulus values for 12 subjects. 


slope and intercept for the transition line are 
denoted by '.) 

For three straight lines the exponent was 
determined as above from the slope of the 
outer lines, a, and a as above from the slope 
of the middle line, a’. The intercepts of the 
outer lines yielded the subjective zeros, 

Knowing the exponent £, the ratio be- 
tween the scale units o, and the ds, the psy- 
chophysical function was computed. From the 
parameters of the straight line(s) the theoret- 
ical values were computed. From the latter 
and the corresponding power function the y 
values were computed and, by dividing them 
by p, the yy values. The yy values in their 
turn were entered into the relevant power 
function (same or lower segment) and the 
corresponding d; values were calculated, 
The points in Figures 2, 6, and 7 are these y 
(open circles) and v; (filled circles) values 
plotted against the empirical 4 and Oy, Fi- 
nally, the "stress," a normalized measure of 


446 Hannes EISLER 
TABLE 5 
“EXPERIENCED” VALUES OF } AND 2 
Subject 
Vali 1 2 3 4 5 6 7 8 9 10 n 12 
alue 
65 -76 1.04 48 42 .69 .32 48 52 i87 38 38 
j 187 209 L7] 127 170 243 231 


243 246 1.56 


experiments. 


; i her two 
Note. The exponent from the equal-setting experiment is assumed to be correct and the same as the exponents from the ot 
Note. - 


tween ES and DS is also good, but 
vibus of B between ES and HS is 
low. The latter result is mainly attributable 
to two subjects, 3 and 10. One possible 
interpretation of the discrepancies between 
ES and HS and DS is a misconception by 
a subject about the numbers 1/2 and 2, 
Table 5 gives the "experienced" numbers, 
assuming the exponent from ES to be the 
correct one. However, if a subject miscon- 
strues a number in ratio setting, it is sur- 
prising that he does well in ME, 

The most dependable experiments prob- 
ably are ES and ME, giving average ex- 
ponents of .84 and .85, respectively, though 
with an interindividual variation of .59-1.04 
for ES and .53-1.12 for ME, 

A by-product of these results is the con- 
clusion that—at least for the continuum of 
time—different individual ex 
perceptual differences rather t 
in numerical habits, 


ponents reflect 


han differences 
since ES is number 


Exponent from magnitude estimation 


0.9 1.0 da 


Exponent from fqual-setting 
Ficure 8. Exponents from m 
tion plotted against exponents fr 


agnitude estima- 
for 12 subjects. 


Om equal Setting 


free (cf. Ekman, Hosman, Lindman, Ljung- 
berg, & Akesson, 1968; Jones & Marcus, 
1961; Jones & Woskow, 1962; Rule, 1966, 
1968, 1969; Rule & Markley, 1971; Wans- 
chura & Dawson, 1974). 

To test whether the ratio between € 
units depends on subjects and tasks, an ana 3 
ysis of variance like the previous one was 
carried out with « as the dependent variable 
(for cases with only one segment, œ was set 
at l). The outcome was the reverse of the 
previous one: no significant differences for 
Subjects, but significant differences for tasks 
(P< 01). Pairwise t testing showed that 
the tasks can be Partitioned into two groups 
namely, ES and DS on the one hand, an 
ME and HS on the other (p « .05 for all 
pairs except ES-DS and ME-HS). This 
result can be explained by a centering tend 
ency: When the total physical duration ?? 
comparatively long, as in ES and DS, thé 
Subjective duration is compressed by a scal@ 


1,5 


Exponents from halving and dubling 


1.5 


Exponent from equal-setting 
FIGURE 9, Ex e 
ünd iu Exponents from half setting (sau? 


s) 
f, $ "d 
etting (circles) plotted against exp 


nents from equal setting, 
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unit less than unity for the longest durations 
(the upper segment) ; but for comparatively 
short durations, as in ME and HS, it is ex- 
panded by a scale unit greater than unity for 
the longest durations., 


EXPERIMENTAL RESULTS AND THE 
THREE MODELS 

The three models cannot be distinguished 
on the basis of goodness of fit of the data. 
The fit ultimately depends solely on how 
well the experimental points fit the straight 
line or lines (see Figure 2A), so no statisti- 
cal test can help in choosing. 

The comparison of the three models is 
based on three considerations: (a) the slope 
of the ®y-® plots, (b) the breaks in these 
plots, and (c) the agreement of the expo- 
nents between the four scaling experiments. 


The Slopes of the ®y- Plots 


This section is concerned only with Model 
A and the slopes for which standard and 
variable lie on the same segment of the psy- 
chophysical function. There d, is seen as 
a function of 4; with the following require- 
ments: The slope c for ES has to be unity ; 
for DS it has to exceed unity, and for HS 
it must not exceed unity, since 8 — log r/ 
log c (Equation 13). For ES, r — 1; for 
DS, r — 2; and for HS, r — 1/2; entailing 
c — ] to obtain a finite 8 for ES, and the 
fulfillment of the requirements mentioned for 
DS and HS to obtain 8 > 0. 

Model A taken as it is. For ES the slope 
differs from unity for all subjects except 
three or four; for example, the slope for ES 
for Subject 1, when plotted in this way 
rather than as in Figure 2A, amounts to .45, 
Furthermore, for DS the slopes for Subjects 
1, 3, and 4 are .95, .94, and .91, respectively. 
Thus the simple Model A has to be rejected. 

Model A corrected for the regression ef- 
fect. The regression effect described by 
Stevens and Greenbaum (1966) might al- 
low a correction. If we assume that the 
"true" slopes of ES are unity, the slopes 
obtained may be attributed to this effect. 
This amounts to introducing a new param- 
eter (at variance with the parsimony rule). 
Dividing the slopes for DS by the slopes for 
ES renders all DS slopes > 1. However, if 
the same correction is carried out for the HS 
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data, the slopes for Subjects 2, 3, 6, and 10 
exceed unity (1.21, 1.56, 1.08, and 1.61, 
respectively). Thus, this version of Model 
A also has to be rejected. Assuming dif- 
ferent regression effects for the different 
tasks would, besides adding two more pa- 
rameters, make the parameters unidentifi- 
able, and a model with identifiable param- 
eters (B or C in this case) is to be preferred. 

Model A with different scale units for 
standard and variable. lf we assume that 
the scale units of standard and variable al- 
ways are different, that is, that standard and 
variable always lie on different segments, 
we would have to introduce one more pa- 
rameter for each ratio-setting experiment; 
instead of the two segments in Figure 2B, 
we would have four. As in the corrected 
Model A above, the parameters were un- 
identifiable, since the exponent and the ratio 
of the scale units are linked (see Equation 
13). And in the same manner as for the 
corrected Model A, this would be at vari- 
ance both with the parsimony rule and the 
rule to choose models with identifiable pa- 
rameters. Therefore, Model A is rejected 
(in all three versions). 


The Breaks in the ®y-® Plots 

Consider Figure 2. As mentioned before, 
the discontinuity in the y-® plot is situated 
at 4 sec, entailing a discontinuity at 4 sec for 
the x-axis and at the same value for the y- 
axis in the p-r plot. If the data were 
plotted according to Model A, ®y as a func- 
tion of ®s, the break at the y-axis would still 
be at 4 sec, but the break at the x-axis at 
about 3 sec. Again, Model A is at variance 
with the data. We will not deal further with 
Model A since it has already been rejected ; 
rather we will investigate whether the posi- 
tion of the breaks can be used in deciding 
between Models B and C. 

As pointed out in the section on data anal- 
yses, with ratio settings, in order not to have 
an overlap in the psychophysical function 
(a unique mapping of y onto ©), the highest 
ordinate of the straight line corresponding 
to the transition region (in Figure 2A it is 
the middle line) must not lie above the low- 
est abscissa of the same straight line. 

For Model B the plot corresponding to 
Figure 2A has ®y as ordinate and the total 


Hannes EISLER 
448 


time (standard + variable) Pr as abscissa. 

HS (which is most sensitive to the 
Coh condition) out of the eight. subjects 
showing a break in the psychophysical func- 
tion, seven violated the condition. (For 
Model C, the same condition was violated by 
only one subject.) Therefore, Model C is 
to be preferred to Model B. 


Agreement of Exponents Between the 
Four Scaling Experiments 

We have already seen that, when treating 
the data in accordance with Model C, we 
obtain high correlations (with one excep- 
tion) over subjects between experiments. 
The correlation between ES and ME, the 
most interesting one, amounts to almost A 

This gives further support for the rejec- 
tion of Model A, since for the Possible ver- 
sions of this model the exponent is not iden- 
tifiable. Model C is thus to be preferred to 
Model A not only because of the identifi- 
ability of the exponent as such, but also be- 
cause its value agrees among experiments 
and thus carries real information, : 

When the exponent is computed in ac- 
cordance with Model B, the correlation be- 
tween ES and ME becomes — 9, likewise 

ing the earlier decision to reject 

Model Be Therefore, Models A and B are 
rejected; Model C is retained, 


THE TIME-ORDER ERROR 
The explanation of the time-order error 


(TOE) is a simple consequence of the pres- 
ent model. Rewriting Equation 19 with 6, 


=, (equal settings), 6 = Bs + d, (s 
standard duration), and y = l, we obtain 
a(s + dy — ©) — (bg — ',)8 
= (r — d'a. 

Rearranging yields 
TOE-24$z7—6; 

Clay Pte [fan 1 bs ao 

(2/o)i8 —1 


We see that the TOE depends, both in sign 
and size, on the interaction of the scale units 
and the subjective zeros on the one hand, 
and the standard duration on the other. 


7In a somewhat complicated and farfetched man- 
ner this correlation could be espina aS a result 
of the “regression tendency Stevens, 1959 ; 
Stevens & Greenbaum, 1966). 
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with the variable duration equal to half the 
total duration, For durations exceeding 
1.18 sec (the indifference point) the TOE 
is negative, since the slope of the straight 
line (the uppermost segment) is .4489 (8 = 
.87), and thus smaller than .5. For shorter 
durations the TOE is positive because of the 
positive intercept of .12, corresponding to 
Po = 22. 

It is seen that the TOE for time is a spe- 
cial case. For other continua it is probably 
possible to describe the TOE in terms of 
different subjective zeros, and perhaps dif- 
ferent scale units for standard and variable 
(cf. Eisler & Ottander, 1963). This may 
indicate that it might be more rewarding to 
study the variation of the parameters of the 
power law with experimental conditions than 
to study the TOE directly. 


SuMMARY AND CONCLUSION 


We have seen that Model C copes with 
the problem of time continuing to elapse 
after the presentation of the first duration 
of a pair, as well as with the problem of in- 
sufficient time for coding and transferring to 
LTM the first duration for short ISIs (by 
eliminating any memory), and that Model C 
is capable of accounting for features in dis- 
crimination characteristic for durations only. 
Furthermore, only Model C can account sat- 
isfactorily for the data obtained in the four 
scaling experiments. Note that the same 
process seems to be at work in discrimina- 
tion studies, where usually short durations 
are used, and in scaling experiments with 
comparatively long durations, 

An instance of particular interest is one 
of the experiments carried out by Richards 
(1964), because it covers a range from 2 
sec to 80 min (standards; the smallest re- 
sponse time is 2.57 sec and the largest total 
time 10,400 sec) and involved one subject. 
Here we find two breaks; one at 10 sec and 
the other at 600 sec. Accordingly, points 
4-7 and 10-12 of the 12 data points can be 
used to determine the exponent; the two 
slopes are .574 and .559, respectively, a dif- 
ference of only .015, corresponding to a dif- 
ference of the exponents of .06. 

Data on duration from other investigators 
can be found in Eisler (Note 3). Again, 
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Model C is the only one of the three models 
that yields satisfactory results for ratio- 
setting experiments throughout. The ex- 
ponents obtained agree for the majority of 
these studies with the exponents found in 
the present series of experiments, thus lying 
somewhat below the value of unity given by 
Stevens (e.g., 1967). 

As a result of the data analysis in combi- 
nation with Model C, it proved possible to 
reject psychophysical functions other than 
Stevens’ power law, and to obtain exponents 
for duration from equal setting, that is with- 
out having to rely on a subject's numerical 
behavior. The latter finding not only ob- 
viates objections to direct scales as being 
artifacts of the subject’s numerical habits; it 
also opens up opportunities to obtain ex- 
ponents for duration from animal experi- 
ments in a more convincing way than, for 
example, in the experiment of Platt, Kuch, 
and Bitgood (1973). An experiment of 
equal setting of duration with rats has in 
fact been carried out with the same equip- 
ment and the same stimuli as described 
above; it will be reported in due course. 

The possibility of determining exponents 
from ES data also allows comparisons be- 
tween different modalities, used for present- 
ing the time intervals, in a more precise 
manner. The same goes for comparisons 
between different groups of subjects, though 
remembering that the exponents for the pre- 
sumably normal subjects in the present ES 
experiment varied between .6 and 1.0, the 
group differences found by Carlson and 
Feinberg (1968), for example, between nor- 
mals (8 = 96) and schizophrenics (8 = 
91) seem rather unimpressive. In this 
connection the importance of treating the 
data individually should be emphasized. 

Model C has also made it possible to de- 
scribe the TOE for durations in a parametric 
way; a task for future research may be to 
investigate in detail which factors affect the 
location of the break in the psychophysical 
function for duration and the values of the 
three relevant parameters: the exponent, the 
scale unit, and the subjective zero. 
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experiments, the reader should consult Ikeda 
and Obonai (1955) and Matin (19742) ? 
In the theoretical interpretation of experi- 
ments of this type, it is generally assumed 
that there are neurons that respond to the 
target when it is presented alone and neu- 
rons that respond to the inducer when it is 
presented alone. (The two sets of neurons 
are not necessarily disjoint.) The task for 
theory, then, is to explain why the occur- 
rence of responses in the inducer neurons 
results in a modification of responses in neu- 
rons associated with the percept of the target 
or in some modification of the overall neural 
responding such that the target signal cannot 
be as easily extracted from the noise. This 
way of thinking certainly seems plausible. 
Indeed, at first sight, it seems so compelling 
that it might appear to exhaust the logical 
possibilities. Nonetheless, it is my purpose 
in this article to suggest that reasoning in 
this manner is unnecessarily restrictive. In 
effect, I want to raise the possibility that at 
least in some cases not two, put three classes 
of neurons could be responding to the two 
stimuli presented. 

The three classes to which I am referring 
include the two that are ordinarily consid- 
ered (the T or target neurons and the I or 
inducer neurons) and a third class that re- 
sponds minimally, if at all, to either target 
or inducer alone. Neurons of this type re- 
spond, however, to their combined presenta- 
tion at appropriate temporal intervals. Ac- 
cordingly, they will be called T-I (target- 
inducer) neurons for the general case. When 
the inducer is a mask, the I neurons will be 
called M (mask) neurons, and the T-I neu- 
rons will be called T-M (target-mask) neu- 
rons. Anticipating the argument to be de- 
veloped more carefully below, I simply point 
out for the moment that succession (motion) 
detectors are among the neurons that might 
be found in this third class and that in gen- 
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3 The two-transient paradigm is also employed 
in studies of apparent movement. However, the 
latter studies differ from the other experiments 
being considered in that they are not generally 
concerned with the measurement of changes in the 
sensation associated with one stimulus as the re- 
sult of the presentation of another. Rather, their 
purpose is to study an entirely new sensation, 
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eral both the members of the T-I class stim- 
ulated on a given trial and the magnitude of 
response from the stimulated neurons de- 
pend on the temporal interval between tar- 
get and inducer on that trial. To forestall 
y possible misunderstanding, however, it 
should be noted from the outset that calling 
these neurons motion detectors does not nec- 
essarily imply their involvement in processes 
leading to the sensation of movement, It is 
also important to note that the reference to 
three classes of neurons does not imply ho- 
mogeneity within these classes;* the hice 
classes are not single channels in the sense 
in which that expression has been used i 
the recent literature (Weisstein, O & 
Szoc, 1975; Graham, in press) LPS 
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EXISTENCE OF TARGET-MASK NEURONS 


Potentially, several types of neurons could 
fall into the T-M class in a lateral masking 
experiment. Cells with large receptive fields, 
for example, might not be optimally stimu- 
lated by either target or mask alone, but 
might respond favorably to their combined 
presentation. However, the most convincing 
evidence for the existence of T-M neurons 
is found in the literature on motion detec- 
tors. There it has been shown, for a con- 
siderable variety of animals, that successive 
stimulation of neighboring retinal regions, 
at appropriate temporal and spatial separa- 
tions, is an adequate stimulus for a motion 
detector. These animals include several in- 
sects (McCann, 1973; Reichardt, 1961, 
1969), rabbits (Barlow & Levick, 1965), 
and frogs (Grüsser-Cornehls, 1968; Grüs- 
ser-Cornehls, Grüsser, & Bullock, 1963). 
To my knowledge, no attempt has been made 
to study this matter in primates. However, 
it seems plausible to suggest that the motion 
detectors that have been found in that order 
of animals (Hubel & Wiesel, 1968; Wurtz, 
1969a, 1969c) could be shown to be succes- 
sion detectors with appropriate experiments. 
Then, if the human does not differ from 
other primates in this regard, it could be 
assumed that neurons exist in the human 
visual system, possibly at a variety of neural 
levels, which are specialized for the detec- 
tion of relative motion between retina and 
environment but which also respond to the 
target-mask stimulus sequence in a lateral 
masking experiment. 

The reader should note carefully, how- 
ever, that relative motion between retina 
and environment does not necessarily imply 
the movement of an object; such motion 


could equally well arise from movements by * 


the observer and, in particular, from the 
movements of his eyes. Indeed, beginning 
with an important series of studies in the 
striate cortex of the awake, unanesthetized 
key (Wurtz, 1968, 19692, 1969b), a 
rapidly accumulating body of evidence has 
become available which shows that informa- 
+. about the motion of the retinal image 
oor saccadic eye movements, including 
men yelocity components, is processed by 
its 


mon 


the visual system (Adey & Noda, 1973; 
Brooks & Holden, 1973; Hoffmann, 1973; 
Noda, 1975; Noda & Adey, 1974; Noda, 
Freeman, & Creutzfeldt, 1972; Noda, Free- 
man, Gies, & Creutzfeldt, 1971; Wurtz & 
Goldberg, 1972). 

In addition to the studies cited in the pre- 
ceding paragraph, which come specifically 
from the literature on saccadic eye move- 
ments, there are scattered references in the 
general neurophysiological literature to the 
existence at various neural levels of cells 
that respond to velocities greater than 100° 
of visual angle per sec. Such references can 
be found in studies specifically concerned 
with neurons responding to movement (e.g., 
Oyster, Takahashi, & Collewijn, 1972, Fig. 
6). They can also be found in the literature 
on W, X and Y cells, in which it has been 
shown that some of the Y cells, but no X 
cells, respond to very high velocities of 
image motion (e.g. Cleland, Dubin, & Le- 
vick, 1971, p. 481; Cleland, Levick, & Sand- 
erson, 1973, p. 652; Hughes & Whitteridge, 
1973, p. 1105; Singer & Bedworth, 1973, pp. 
296-297). It seems unlikely that cells re- 
sponding to very high velocities are involved 
in the sensation of motion, because, at least 
in man, such sensations are only elicited by 
slow movements. However, a large range 
of velocities, including some very high ones, 
are quite routinely generated on the retina 
during saccadic eye movements. Accord- 
ingly, it seems reasonable to suggest that 
those broadly tuned Y cells which respond 
to high velocities of image motion are a 
part of the neural apparatus that is process- 
ing information related to saccadic eye move- 
ments or other rapid bodily movements, with 
the more peripheral cells of this type feed- 
ing into central cells sensitive to high veloci- 
ties. As will be seen shortly, an important 
role will be assigned to these neurons in 


6 The X-Y cell dichotomy was introduced by 
Enroth-Cugell and Robson (1966). These two cell 
types are sometimes also referred to as sustained- 
transient (Cleland, Dubin, & Levick, 1971) and 
Type II, Type I (Fukada, 1971). More recently 
a third category (W) has been introduced (Stone 
& Hoffmann, 1972). 
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EXISTENCE OF TARGET-MASK NEURONS 


Potentially, several types of neurons could 
fall into the T-M class in a lateral masking 
experiment. Cells with large receptive fields, 
for example, might not be optimally stimu- 
lated by either target or mask alone, but 
might respond favorably to their combined 
presentation. However, the most convincing 
evidence for the existence of T-M neurons 
is found in the literature on motion detec- 
tors. There it has been shown, for a con- 
siderable variety of animals, that successive 
stimulation of neighboring retinal regions, 
at appropriate temporal and spatial separa- 
tions, is an adequate stimulus for a motion 
detector. These animals include several in- 
sects (McCann, 1973; Reichardt, 1961, 
1969), rabbits (Barlow & Levick, 1965), 
and frogs (Grüsser-Cornehls, 1968; Grüs- 
ser-Cornehls, Grüsser, & Bullock, 1963). 
To my knowledge, no attempt has been made 
to study this matter in primates. However, 
it seems plausible to suggest that the motion 
detectors that have been found in that order 
of animals (Hubel & Wiesel, 1968; Wurtz, 
1969a, 1969c) could be shown to be succes- 
sion detectors with appropriate experiments. 
Then, if the human does not differ from 
other primates in this regard, it could be 
assumed that neurons exist in the human 
visual system, possibly at a variety of neural 
levels, which are specialized for the detec- 
tion of relative motion between retina and 
environment but which also respond to the 
target-mask stimulus sequence in a lateral 
masking experiment. 

The reader should note carefully, how- 
ever, that relative motion between retina 
and environment does not necessarily imply 
the movement of an object; such motion 
could equally well arise from movements by 
the observer and, in particular, from the 
movements of his eyes. Indeed, beginning 
with an important series of studies in the 
striate cortex of the awake, unanesthetized 
monkey (Wurtz, 1968, 1969a, 1969b), a 
rapidly accumulating body of evidence has 
become available which shows that informa- 
tion about the motion of the retinal image 
during saccadic eye movements, including 
its high velocity components, is processed by 


the visual system (Adey & Noda, 1973; 
Brooks & Holden, 1973; Hoffmann, 1973; 
Noda, 1975; Noda & Adey, 1974; Noda, 
Freeman, & Creutzfeldt, 1972; Noda, Free- 
man, Gies, & Creutzfeldt, 1971; Wurtz & 
Goldberg, 1972). 

In addition to the studies cited in the pre- 
ceding paragraph, which come specifically 
from the literature on saccadic eye move- 
ments, there are scattered references in the 
general neurophysiological literature to the 
existence at various neural levels of cells 
that respond to velocities greater than 100° 
of visual angle per sec. Such references can 
be found in studies specifically concerned 
with neurons responding to movement les; 
Oyster, Takahashi, & Collewijn, 1972, Fig. 
6). They can also be found in the literature 
on W, X and Y cells? in which it has been 
shown that some of the Y cells, but no X 
cells, respond to very high velocities of 
image motion (e.g., Cleland, Dubin, & Le- 
vick, 1971, p. 481; Cleland, Levick, & Sand- 
erson, 1973, p. 652; Hughes & Whitteridge, 
1973, p. 1105; Singer & Bedworth, 1973, pp. 
296-297). It seems unlikely that cells re- 
sponding to very high velocities are involved 
in the sensation of motion, because, at least 
in man, such sensations are only elicited by 
slow movements. However, a large range 
of velocities, including some very high ones, 
are quite routinely generated on the retina 
during saccadic eye movements. Accord- 
ingly, it seems reasonable to suggest that 
those broadly tuned Y cells which respond 
to high velocities of image motion are a 
part of the neural apparatus that is process- 
ing information related to saccadic eye move- 
ments or other rapid bodily movements, with 
the more peripheral cells of this type feed- 
ing into central cells sensitive to high veloci- 
ties. As will be seen shortly, an important 
role will be assigned to these neurons in 


9 The X-Y cell dichotomy was introduced by 
Enroth-Cugell and Robson (1966). These two cell 
types are sometimes also referred to as sustained- 
transient (Cleland, Dubin, & Levick, 1971) and 
Type II, Type I (Fukada, 1971). More recently, 
a third category (W) has been introduced (Stone 
& Hoffmann, 1972). 
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the suggestions about lateral masking that 
are outlined below.* 


MASKING as A JoixT RESULT or Mask 
AND TancET-MasK NEURONS 


If the assumption that T-M neurons are 
stimulated in a lateral masking experiment 
is correct, we are coníronted with three 
logically possible alternatives : (a) The re- 
sponses of the T-M neurons could be irrele- 
vant to the masking effect, which would be 
due entirely to the M neurons; (b) all T-M 
neurons could contribute to masking; or 
(c) some, but not all T-M neurons could 
contribute to it. None of these alternatives 
can be accepted or rejected out of hand. 
However, I choose the third for further ex- 
ploration and, in effect, follow earlier sug- 
gestions that some motion detectors could be 
involved in the sensation of movement and 
others in saccadic suppression and visual 
masking (Matin, 1974b, in press). The 
discussion begins with a brief summary of 
that information about lateral masking which 
is necessary for the Subsequent analysis, 


Some Pertinent Facts 


When lateral masking is studied, the back- 
ward temporal intervals (target precedes 
mask) are called metacontrast and the fore- 
ward intervals paracontrast. In general, and 
a priori perhaps somewhat surprisingly, 
there is a gross asymmetry in magnitude of 
effect in these two cases. Whereas para- 
contrast is usually a relatively weak phe- 
nomenon, metacontrast can be very power- 
ful: Under appropriate experimental condi- 
tions, stimuli that are clearly visible when 
presented alone can undergo marked de. 
creases in brightness or can even be sup- 
pressed entirely (Alpern, 1953; Fry, 1934. 

7 Implicit in this discussion is the assumption 
that all motion-sensitive cells, including the Y cells, 
are succession detectors. Although this assumption 
is plausible, it goes beyond the established facts 
and it would certainly be desirable to study more 
kinds of motion-sensitive cells „with stimuli de- 
livered successively to neighboring retinal areas, 
Note also that in this paper the terms motion de- 
tectors and motion-sensitive cells are used very 
loosely to indicate cells that are sensitive to move- 
ment. These neurons might include the more 
stringently defined direction-specific cells consid- 
ered by some authors (e.g. Barlow & Hill, 1963), 
but they might also include other types of cells. 


ETHEL MATIN 


Schiller & Smith, 
Frequently, along with 
brightness, subjects report an apparent 
movement of target toward mask 
1966 ; Kahneman, 1967 ; 


1966 ; Werner, 1935). 


the changes in 


metacontrast is 4 q 
tion of time betw 


and the onset of the mask. 

masking js Maximal when th 

cedes the mask by some int 

gaan temporal functions 
“Shaped. Depending 4... 

l Ng again or ic 
lar conditions i ps eue 
en eR erval can 
wal Won few msec to more than 100 

. "hen the observer. is th 

e. €r s task is 
ognition or detection of the t tae 
considerable controversy 
function: ` 


een the onset of the target 


Under others, 
e target pre- 
erval; the ob- 


shape of 
imenters 


shaped when a 
threshold ; 


: ader lcal point of 
masking terat who 1s unfamiliar with the 
ook the Proviso | might be inclined to over. 
be of gi; ilar i that mask and target must 
misled in think fe, therefore could be 
Hn f RUE that a simple explana 
in the ten Ped functions could be found 
might seem dos rend elect. That is i 
* Those wi au a high-energy mask will 
Se wj 10 á à 
literatur, Ta Dt famili i 
r : $ ar wit as 
onic function advised that the ex : xia ees 
ackvand RA = TS only ibd fps 3nono- 
Monoton c T ing alone, o máskis "Rusking or 
Dora] inter i the entire Possible E function is 
9 Bor ls studied j ange of tem- 
the 
tion jg pote af thi: diac 
m S dis s " 
Ments ES between skin, ussion, no distinc- 
detection P iEhtness and maskin. involves decre- 
adviseq tha timents OWeye 5 OLD Studied in 
ered, Consult ph à distincti n sh (reader is 
Becker "2 À Bric Beman (1975 Uld be consid- 
stein pi aD Hoffm; ? D: 96), Eriks 
tin (19 an (19 » Erikson 
72, p 2 P. 245 ] 
P. 235). ), and Weis- 


TRANSIENT PARADIGM 455 


produce a neural response at a shorter la- 
tency than a target of lower energy and that 
this difference in latency could account for 
the apparently retroactive effect of mask on 
target. I therefore emphasize that the facts 
do not support such an interpretation: If 
mask and target energies are held fixed 
throughout the range of temporal intervals 
explored, the temporal functions are U- 
shaped when mask and target are of similar 
energy and monotonic when the mask is 
considerably greater in energy than the tar- 
get (Kolers, 1962; Weisstein, 1968, 1972). 
On the other hand, a variety of other ex- 
planations have been proposed for U-shaped 
functions (Breitmeyer, Love, & Wepman, 
1974; Bridgeman, 1971, 1975; Fehrer, 1965, 
1966; Kahneman, 1967; Mayzner, Tresselt, 
& Helfer, 1967; Weisstein, 1968, 1972; 
Weisstein et al., 1975; Werner, 1935). As 
will be seen in subsequent paragraphs, the 
present article will produce an addition to 
this already long list of suggestions. 


Apparent Movement and. Metacontrast 


As was noted previously, several experi- 
menters have commented on a correlation 
between apparent movement and masking 
in metacontrast experiments ( Biederman- 
Thorson, Thorson, & Lange, 1971; Breit- 
meyer, Love, & Wepman, 1974; Fehrer, 
1965, 1966; Fehrer & Raab, 1962; Kahne- 
man, 1967; Kolers & Rosner, 1960; Toch, 
1956). In addition, a theory has been de- 
veloped based on the assumption that this 
correlation’ is causal (Kahneman, 1967). 
According to this theory, metacontrast is a 
case of impossible apparent movement. "Thus, 
in the frequently employed display with 
three rectangles (a central one as target and 
two flanking ones as a mask), a single ob- 
ject is set into apparent motion in two di- 
rections—a physically impossible movement, 
Similarly, in the disc-ring display, the stim- 
ulus sequence presented would cause the 
original disc to grow larger even as it dis- 
appears. Kahneman suggested that suppres- 
sion of the target occurs because the per- 
ceptual system is unable to resolve these 
contradictions. He supported his conten- 
tion of a causal relation between masking 
and apparent movement with an experiment 
in which these two phenomena were studied 


with the same subjects and similar displays. 
Under his conditions, the temporal func- 
tions for apparent movement and metacon- 
trast were virtually identical. 

Although there are many difficulties with 
Kahneman's theory, it might have been pos- 
sible to reformulate it in a satisfactory way 
if further evidence had substantiated the 
high correlation between temporal functions 
for apparent movement and masking that it 
implies. This, however, was not the case. 
It was shown, for example, that the resem- 
blance between them breaks down when vis- 
ual angle, luminance, and duration are ex- 
plored over a sufficient range (Weisstein & 
Growney, 1969). Kolers also emphasized 
that the two phenomena are different in 
basic ways. Among other differences, he 
noted that masking falls off sharply as dis- 
tance between target and mask is increased, 
but apparent movement does not, and that 
masking is markedly sensitive to similarity 
of features in target and mask, but apparent 
movement is not (Kolers, 1972, pp. 131- 
132). 

Clearly, then, there is no causal relation 
between apparent movement and masking. 
Nonetheless, the intriguing correlation. be- 
tween the two effects remains and needs to 
be accounted for. The T-M neurons, I 
think, offer considerable promise toward that 
end. Recall the previously cited evidence 
which shows that at least some movement 
detectors respond to velocities well above 
those eliciting the sensation of motion in 
man. As was noted, such velocities would 
normally be produced in the course of sac- 
cadic eye movements. Let us suppose, then, 
that a variety of broadly tuned neurons 
exist, which detect relative motion between 
retina and environment. (Such neurons 
are not, of course, first-order neurons, nor 
need they be retinal neurons at all. More- 
over, there is no reason to assume that thev 
exist at only one neural level.) 1 y 


Some of 


10 The cells referred to herein as target neurons 
and those referred to as mask neurons include 
first-order neurons and various other retinal and 
central neurons, presumably organized in some 
hierarchal manner as the visual message proceeds 
from the periphery through the central nervous 
system. On the other hand, because the types of 
T-M cells being considered in this 
motion detectors, 


paper consist of 
they do not include first-order 
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ight also be helpful in resolving the con- 
ae y in the masking literature about the 
A v U-shaped functions when the 
RA task is detection or recognition of 
T (see Kahneman, 1968, and Lefton, 
1972, for references). In experiments x 
this type, stimuli employed as targets tene 
to be weak. Therefore, differences in ex- 
perimental conditions that might appear 
slight and apparently insignificant could re- 
sult in a firing of T-M neurons In some 
studies but not in others, and therefore to 
reports of U-shaped functions i 
ies but not in others 
419), 


n some stud- 
(cf. Kahneman, 1968, p. 


Paracontrast 


Because responses in the T-M neurons 
are elicited by Successive stimulation of 
neighboring retina] areas, it could be ex- 
pected that firing would occur in these ney- 
rons to the paracontrast Sequence as well 
the metacontrast Sequence, At first sight, 
therefore, it might seem that the suggestions 
being considered would lead to a prediction 
of masking functions that are symmetrical 
around the origin (simultaneous presenta- 
tion of. target and mask). Such a predic- 
tion, of course, would be Contrary to fact, 
In those cases where Metacontrast js U- 
shaped, paracontrast is much weaker and 
does not extend as far fr 


om the origin (Al- 
pern, 1953; Kolers & Rosner, 1960) 


A more careful look at the situ 
ever, shows that masking as a joint result of 
the M and T-M neurons implies Stimulus 
asymmetries between the forward and back- 
ward temporal intervals that do Not exist for 
the mask stimulus alone, that is, stimulus 
asymmetries other than the mere fact that 
the target is flashed before the mask in meta. 
contrast and the mask before the target in 
paracontrast, A fundamental temporal and 


that are called U-shaped), 
siderations suggest that expi 
sensitivity would show peaks 
at zero. 
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a secondary spatial asymmetry for the ve 
neurons are described in the next ies i" - 
graphs. This description is ig +. 
further discussion showing that a p m iih 
explanation for the weakness of n e 
relative to metacontrast can be derived ro ; 
the fundamental asymmetry. In addition, ; 
will raise the possibility that the secondary 
asymmetry might also contribute toa weaker 
paracontrast. The latter possibility, € 
ever, is by no means crucial to the overa 
argument, . : 
Consider again what happens in a meta- 
contrast experiment, The target is pre- 
sented, followed by the mask, Accordingly, 
the target-mask stimulus sequence is cami- 
pleted when the mask is presented, For 
paracontrast, however, the completion of the 
target-mask sequence coincides 
with the presentation of the target, not the 
mask. In a sense, therefore, it might be 
said that the target-mask stimulus follows 
the target in metacontrast but is simulta- 
neous with the target in paracontrast. 
€ asymmetry between 1 
Paracontrast for the T-M neurons described 
in the Preceding Paragraph is inherent to the 
Masking paradigm Per se and independent 


of the Particular stimuli that are chosen as 
target and mask, In addition, 


other secondary stimulus 
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consists of the single central rectangle, fol- 
lowed by two flanking ones; for paracon- 
trast, the converse is true. I know of no 
convincing a priori reason to expect the 
paracontrast sequence to be less favorable 
for the stimulation of T-M neurons than the 
metacontrast sequence. Nonetheless, this 
possibility exists and should be explored. 
In any event, it seems extremely unlikely 
that the effect of the two different sequences 
could be identical, and it would clearly be 
desirable to perform quantitative studies with 
a simpler type of stimulus configuration 
without this asymmetry," that is, studies 
with two identical stimuli, one the mask, the 
other the target, located in such a manner 
that they fall on retinal tissues of similar 
sensitivity. In particular, a study with tar- 
get and mask consisting of two parallel slits 
would be useful. In addition to eliminating 
the spatial differences between metacontrast 
and paracontrast for the T-M neurons, the 
use of such slits would have an additional 
advantage: It might tend to generate data 
that would facilitate an analysis of the sug- 
gested relation between masking and mo- 
tion detectors because moving slits are fre- 
quently employed in the study of the latter. 

Although the secondary stimulus asym- 
metry is removable by an appropriate choice 
of stimuli, the other, more fundamental 
asymmetry is not. Differences between 
metacontrast and paracontrast would there- 
fore be expected even in experiments with 
simpler spatial configurations, Moreover, 
the considerations about the rapid conduc- 
tion velocity of the T-M. neurons relative to 
the T neurons that were raised in the pre- 
vious discussion of metacontrast lead to the 
expectation that these differences would be 
in the direction of a weaker paracontrast. 
For the latter, the stimulus for the T-M 
neurons is completed when the target is pre- 
sented. Therefore, if the T-M neurons con- 
duct more rapidly than the T neurons, their 
effect might be at least partially dissipated 
by the time the responses of the T neurons 
arrive at the site of interaction. (Compare 

16 Such stimuli have occasionally been used in 
the masking literature (e.g., Stigler, 1910, who de- 
scribed masking experiments in which the target 
was half of a disc and the mask the other half). 
To my knowledge, however, no quantitative data 
are available on this question. 
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this with the situation in metacontrast, for 
which the more rapid conduction velocity of 
the T-M neurons works in the direction of 
an increase in masking, because it gives the 
responses of the T-M neurons an oppor- 
tunity to overtake the responses in the T 
neurons which were stimulated earlier.) In- 
deed, if the difference in conduction velocity 
between the T and the T-M neurons were 
great enough, the T neurons in a paracon- 
trast experiment could escape the masking 
effect of the T-M neurons entirely, The 
paracontrast effect would then be the result 
of the M neurons alone. In either case, 
whether the escape is partial or complete, a 
paracontrast that is weaker than metacontrast 
would be predicted. 


CONCLUDING REMARKS 


As was shown in the preceding discus- 
sion, a considerable amount of information 
has become available in recent years about 
neurons in the visual nervous system that 
are responsive to rapid displacements of the 
retinal image. This information forms the 
empirical basis for the speculations about 
lateral masking in this article. In essence, 
I have raised the possibility that broadly 
tuned motion detectors, capable of respond- 
ing to high velocities of image motion, are 
being tapped by the target-mask stimulus 
sequence in a lateral masking experiment, 
and I have explored a few of the implica- 
tions of that assumption. The suggestions 
presented are not, and were not meant to be, 
a complete model of lateral masking. For 
example, the argument developed explicitly 
attributes some of the overall masking effect 
to neurons stimulated by the mask (ie., to 
those neurons that are entirely responsible 
for the masking effect in other theoretical 
treatments). However, almost no attempt 
to examine the possible effects of these mask 
neurons was made." Nor does the paper 
even include a complete exploration of the 


17 Nothing that was said in this article per se 
precludes any of the various masking mechanisms 
that have been previously proposed. For example, 
masking by the target-mask neurons could exist 
in addition to the masking of the X cells stimu- 
lated by the target by the Y cells stimulated by 
the mask, as has been suggested in recent theo- 
retical treatments of metacontrast (Breitmeyer & 
Ganz, in press; Breitmeyer, Love, & Wepman, 
1974; Weisstein ct al., 1975). 
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the psychological magnitude.” However, 
Pollack (1965, p. 565) speaks of the poten- 
tial of the iterative technique for producing 
"unbiased psychophysical functions,” and 
Montgomery and Eisler (1974, p. 441) treat 
the result as an “equal interval scale.” To 
clear up these discrepant interpretations, it 
is necessary to determine what is assumed 
when using the iterative technique and what 
it does. 

The present article has three interrelated 
goals. The first is to provide a theoretical 
foundation for Stevens’ iterative scaling pro- 
cedure. That will be done using a mix- 
ture of ideas from integration theory and 
functional measurement (Anderson, 1974a, 
1974d) and from Parducci’s (1965, 1974) 
range-frequency theory of category judg- 
ment. 

The second goal is to contribute to the 
general theory of category judgments that 
stems from Parducci’s range-frequency the- 
ory. These first two goals are the subject 
of the first part of this article. 

The third goal is to present a general 
perspective on context effects. Instead of 
being treated as undesirable biases, context 
effects can be incorporated into substantive 
theory. This perspective, illustrated by the 
analysis of the distribution effect, is seen to 
have wide relevance in the second part of 
this article. By taking a substantive atti- 
tude toward context effects, functional mea- 
surement is able to provide a unified ap- 
proach to many traditional problems in psy- 
chophysics. E 


'THEORETICAL ANALYSIS 
Notation 


'The notation for the subsequent theoreti- 
cal analysis is simple, but it will be helpful 
to summarize it here. In the experimental 
task under consideration, the subject is pre- 
sented one stimulus at a time and estimates 
the magnitude of some quality of that stim- 
ulus, such as the loudness or pitch of a tone. 
The set or distribution of stimuli presented to 
the subject is assumed to have a physical met- 
ric. For convenience, the same symbol S will 
be used to denote the stimulus and its physical 
value. These values are taken to lie between 


S, and Sy, the lower and upper bounds of 
the stimulus distribution. 

Three functions of S will be needed. The 
first is the response function R(S); it de- 
notes the theoretical or the mean response 
to S. The second is the cumulative fre- 
quency function F(S); it can be treated as 
the percentile rank of S. The third and last 
is the psychophysical function V*(S); it 
represents the perceptual magnitude of S. 

These three functions will be represented 
in a common, normalized, percentile form. 
Thus, F(S)will be measured as the ordinary 
frequency percentile, with values between O 
and 100. Correspondingly, the values of 
R(S) will arbitrarily be taken to lie on a 
0-100 scale. To represent the psychophysi- 
cal function in an analogous percentile form 
requires a linear transformation on V*(S) 
to obtain the percentile form V (S): 


V(S) = 100[7* (S) — V*(51)]/ 
[V*(Sg) — V*(Si)]. (1) 


In this expression the denominator is the 
subjective stimulus range, and the numera- 
tor is the perceptual value of S relative to 
a zero point at the lower end of the range. 
Thus defined, V (S) is a linear transforma- 
tion of V*(S) and may also, for present 
purposes, be treated as the psychophysical 
function. 

For expositional purposes, it will often 
be convenient to speak in terms of discrete 
stimulus distributions and discrete response 
categories. In the theoretical analyses, how- 
ever, it will be assumed that the response 
scale is effectively continuous, a graphic 
rating scale, for example. Also, the distri- 
bution of stimuli presented for judgment 
will be assumed to be effectively continuous 
so that, in particular, R(S) is defined for 
all S. These continuous distributions can 
be viewed as approximations to the histo- 
gram forms of the discrete distributions, or 
vice versa. In the iterative technique itself, 
each stimulus is ordinarily presented equally 
often, so each histogram component would 
have constant area, varying in height to re- 
flect the interstimulus distances. 


Subscript 2 will denote the nth iteration, 
and subcript o» will denote the limit of the 
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the frequency function.* Note that this re- 
sult does not define the shape of the response 
function ; that depends on the physical metric 
of S. 

The rationale leading to Equation 3 is es- 
sentially a formalization oí the iterative tech- 
nique as it has been presented in the litera- 
ture. This formalization makes clear the 
aim and nature of the iterative technique, 
but it also brings out some serious limita- 
tions. First is the lack of any proof that 
the iterative technique converges. The pre- 
vious reliance on experimental demonstra- 
tions of convergence is sensible in itself, but 


? A curious aspect of the iterative technique is 
that the response categories will not in general be 
used equally often in the limit, contrary to asser- 
tions by Stevens (1957) and Stevens and Galanter 
(1957). They give no basis for this assertion, but 
presumably it was thought to be required by the 
claims made for the iterative technique. If indeed 
the iterative technique does produce a stimulus dis- 
tribution that neutralizes potential bias caused by 
the tendency to use the response categories equally 
often, then it would seem that such neutralization 
would require equal category usage. However, 
that will be true only under special conditions. 

If response variability was zero, then each re- 
sponse would be used equally oíten in the limit. 
Equation 3 implies that a unit increment in stimu- 
lus frequency corresponds to a unit increment on 
the response magnitude scale, and so also, given 


zero response variability, to a unit increment in 
response frequency, 


However, if response variability is 
then the responses that actually fall w 
interval on the response scale will ir 
strays from adjacent intervals. 
sponses that stray out of each inte. 
those that stray in, which is not 
expected, the response categories v 
with equal frequency. 

This point was first made b 
Eisler (1974), who also seem to be the first to 
report relevant data. In fact, their response fre- 
quencies had a U-shape, with more responses in the 
two end categories and fewer in the middle. They 
note that the response variability was very low 
for the two end stimuli, which were, accordingly, 
almost always assigned to the two end response 
categories, very seldom to the adjacent categories, 
However, response variability was markedly higher 
for the adjacent stimuli, which were assigned to 
the two end response categories with appreciable 
frequency. Thus the two end categories gained 
in frequency of usage at the expense of the ad- 
jacent categories. The loss suffered by the latter 
would in turn propagate inward to the middle 
categories, augmented perhaps by further unequal 
response variability. 
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that leaves the equal-írequency assumption 
untested and its role unclear. This assump- 
tion seems crucial, yet the iterative tech- 
nique seems rather to impose it than to test 
it. If convergence can occur with unequal- 
irequency tendencies, then the iterative 
technique may not only fail to neutralize the 
one bias, but also may introduce another 
bias of its own. 

No less serious is the fact that the itera- 
tive technique provides no substantive inter- 
pretation of the limit response function 
RZ(S). The disagreement among the three 
quotations above reflects this shortcoming. 

Both limitations of the iterative technique 
stem from the lack of any underlying theory. 
To give meaning to the iterative technique, 
therefore, it is necessary to have recourse to 
some substantive theory. Parducci's range- 
frequency theory, which will be employed 
here, has the great advantage that it can be 
generalized to provide an analytical treat- 
ment even when the equal-frequency as- 
sumption fails, 


Range-Frequency Theory 


In Parducci’s (1965, 1974) theory of cate- 
gory judgment, the overt response is a com- 
promise between two covert response tend- 
encies or values. These are the range value 
and the frequency value. The range value 
can be represented as the range percentile 
V(S) of Equation 1; it is just the percep- 
tual magnitude of S, expressed in terms of 
its relative location on the subjective scale 
between the two end stimuli. For the time 
being, it is assumed that the subject tends 
to use each response category equally often, 
as with the iterative technique above. Ac- 
cordingly, the frequency value will b 
resented in cumulative form as the ordinary 
frequency percentile, This definition of the 
frequency value exactly parallels that of the 
Tange value, 

In general, the Tange tendency and the 
frequency tendency will disagree about the 
response to any given stimulus. Responses 
controlled by the equal-frequency tendency 
alone would produce a uniform response 
distribution, each response being used 
equally often regardless of the actual distri- 
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bution of stimulus values. Responses zm 
trolled by the range tendency alone gor 
exactly mirror the (subjective) stimulus 
distribution. - ' 

The actual response to any given stimulus 
is assumed to be a compromise between the 
range tendency and the frequency tendency. 
Mathematically, this range-frequency com- 


promise is assumed to follow an averaging 
model: 


R(S) 2wV(S) + (1—w)F(S) + te 


where w is a relative weight, 0 < w < 1, and 
e(S) represents response variability. The 
Tesponse variability is assumed to have zero 
mean for each 5 and can be ignored in what 
follows. Accordingly, R(S) will refer to 
the mean value of the response, 


A basic assumption in range-frequency 
theory is that the range value is independent 
of the stimulus distribution. This assump- 
tion does not disallow contrast effects, how- 
ever, because the zero of the perceptual scale 
is lost in the percentile representation of 
Equation l. Thus, range-frequency theory 
is not inconsistent with an interpretation of 
distribution effects as changes in perceptual 
value proper, though it does require V*(S) 
itself to be linearly related across different 
distributions, 


It should be emphasized that the present 
interpretation of the range value as the basic 
perceptual stimulus value is an assu 
Parducci (1974, p. 137) prefers a nominal 
interpretation as a model parameter, in part 
because of a proper concern that the range 
value must yet be shown to have cross-task 
generality before it can be considered as a 
basic scale value. Applications of functiona] 
measurement in other tasks (e.g., Anderson, 
1972a, 1974c; Weiss, 1975) have Supported 
cross-task invariance, and have illustrated 
how invariance can provide a powerful tool 
for analyzing the structure of perception and 
judgment. This matter deserves systematic 
study within range-frequency theory, espe- 
cially because of its bearing on the Nature of 
the rating process. 


mption, 


Range-Frequency Analysis of the Iterative 
Technique 


The range-frequency model of Equation 4 


can be applied directly to represent the it- 
erative technique : 


Ra (S) = wV (S) + (1— w)F,CS). (5) 


The iteration index 1 is omitted from V(S) 


since it is assumed to be independent of the 


stimulus distribution. From Equations 5 
and 2, 


R,(S) = wV (S) + (1 — w)R, (S). 
(6) 

This difference equation converges under 
quite general mathematical conditions. For- 


mally, the solution can be obtained by setting 


Ri(S) = Raa (S) = R.(S), and solving to 
obtain 


Equation 7 sa 
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Comments on Assumptions 


Basic to the above analysis is the equal- 
frequency assumption, that all responses 
tend to be used equally often. This assump- 
tion will not be correct in general. As an 
extreme example, one would not ordinarily 
expect the two response categories, tiny and 
larger than tiny, to be used equally often in 
judgments of size. 

But even for an ordinary rating scale, with 
categories ranging from, say, very very small 
to very very large, a reasonable expectation 
would be for a normal-like distribution of 
usage. The normal distribution has ample 
basis in daily experience, in which the rough 
rule is that most events are near average in 
magnitude and extreme events are infre- 
quent. In contrast, the flat, equal-frequency 
distribution is rare in daily experience. 

Clearly, the equal-frequency assumption 
demands special attention. It may turn out 
to be correct, but it requires better empirical 
support than is currently available. 

A second basic assumption is that the 
range-frequency model represented by Equa- 
tion 4 is correct. In support of this assump- 
tion, Parducci's theory has had substantial 
success. The range-frequency model has 
been dramatically superior to  Helson's 
(1964) adaptation-level theory of category 
judgments in various qualitative tests ( Par- 
ducci & Perrett, 1971). Although the pub- 
lished quantitative tests present a cloudy 
picture, the model does have considerable 
potential. 

There is a final basic assumption, one that 
has a somewhat different conceptual status 
and function. This is the assumption that 
the overt response is an interval scale. But 
the iterative technique will converge even 
with an ordinal response scale; the assump- 
tion that the overt response is an interval 
scale serves only the one purpose of inter- 
preting the limit response function as the 
psychophysical function. This assumption is 
not accepted by all, as shown by the quoted 
passages above, and its nature and status re- 
quire a more detailed discussion. That dis- 
cussion is deferred until the second part of 


this paper. 


Generalized Range-Frequency Model 


The equal-frequency assumption is basic 
to the iterative technique, and it has been 
standard in nearly all applications of range- 
frequency theory. However, there is rea- 
son to suspect the equal-frequency assump- 
tion and rather little solid evidence to sup- 
port it. Accordingly, it is desirable to 
reconsider the above analyses while allowing 
for tendencies to use the response categories 
with unequal frequency. This section con- 
siders the generalization of the range-fre- 
quency model. The next section shows how 
the iterative technique suffers under the un- 
equal-frequency assumption. The following 
section will present exact tests of the range- 
frequency model. 

To handle the unequal-frequency assump- 
tion, it is useful to begin by stating a very 
general version of the range-frequency 
model in which the overt response is an 
average of a generalized range value YS) 


and a generalized frequency value F(S). 
Thus, 


R(S) =wV(S) + A-w)F(S). (8) 


No assumption is made about the form of 
V or F except that they be expressed in the 
percentile form indicated above. 

The essential properties of this generalized 
range-frequency model are additivity and 
independence. The generalized range value 
is assumed to be independent of the distri- 
bution of stimuli, and the generalized fre- 
quency value is assumed to be independent 
of the stimulus magnitudes. These proper- 
ties alone are sufficient to provide a test of 
the model, and that is illustrated in Array 
17 below. Here it is necessary to consider 
how the unequal-frequency assumption af- 
fects the specification of P and Ñ. 

Parducci (1965, 1974) allows in principle 
for tendencies to use the response categories 
with unequal frequencies. His specification 
of F(S) seems straightforward. The fre- 
quency scale is to be divided into frequency 
categories, one for each response category, 
of size equal to the magnitude of the tend- 
ency to use the corresponding response cate- 
gory. The generalized frequency value of 
any stimulus then corresponds to the value 
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» the Seneralized fre- 
quency value of stimulus SE 


F(S) = E[F(S)]. 


(9) 
According to Parducci (1974, P. 136), the 
range value of S depends only on the two 
end stimuli and On S itself, That implies 
that the range value js independent of how 
the categories are defined, a 
that has also been used by Birnbaum (1974). 
Accordingly, 
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Indeed, the strongest tests are obtained by 
using specially chosen distributions, and 
some striking qualitative results have been 
obtained in this way. Thus, Parducci and 
Perrett (1971; see Anderson, 1974a, Figure 
6) predicted and obtained a crossover of the 
response junctions obtained from a U-shaped 
and a normal distribution. In the same way, 
Birnbaum (1974) predicted and obtained a 
double crossover of the response functions 
from two bimodal distributions. Other the- 
ories of category judgment, such as that of 
Helson (1964), cannot account for these 
crossovers, 

Quantitative tests are also necessary, es- 
pecially if the range-frequency model is to 
be applied to psychophysical scaling. Un- 
fortunately, the quantitative tests have not 
been generally satisfactory. The present 
paper takes a functional measurement ap- 
proach in which the range-frequency model 
itself becomes the base and frame for scaling. 

Two main methods of testing are possible : 
the constant range method, and the range 
extension method. Both rely on comparing 
the response to the same stimulus across 
different distributions. The constant range 
method was first used by Parducci and Per- 
fett (1971). The range extension method 
was introduced by Anderson (1974a; Note 
1, Section ILB.8), who Save a functional 
measurement analysis of both methods, In 
addition, a power series method, noted below, 
has been developed by Birnbaum (1974). 
who has also stressed the func 
surement approach to the 
effects in psychophysics, 

The essential ideas of the constant range 
method and the range extension method will 
be developed more fully here, although no 
attempt will be made to cover all the various 
statistical considerations, Each method will 
be considered first under the equal-frequency 
assumption, and then they will be combined 
to handle the generalized range-frequency 
model. 

The first method compares across stimu- 
lus distributions that have the same end 
stimuli plus one or more additional common 
stimuli, By virtue of Equation 1, each such 
common stimulus has the same range per- 


tional mea- 
study of context 


centile in each distribution. For the ith dis- 
tribution, therefore, 


R(S) = wh (S) + (1 —w)Fi(S). 
(12) 


Thus, the observable Ri(S) is a linear func- 
tion of the known value of Fi(S) for fixed 
S. Standard regression analysis can be used 
to test this linearity prediction with only one 
stimulus common to three distributions. A 
more efficient statistical test can be obtained 
by pooling data across distributions with dif- 
ferent pairwise common stimuli, 

A striking implication of Equation 12 is 
that just one such test can determine the 
complete psychophysical function V(S). 
Both the Tesponse function and the fre- 
quency function are known, and the «v pa- 
rameter is estimated by the regression anal- 
ysis. Accordingly, Equation 12 yields an 
estimate of (S) in terms of known quan- 
tities : 


V (S) = (1/w) (RS) — (1— w)F,)]. 
(13) 


Of course, each distribution should yield the 
same (S), and that provides a further 
check. 

The main idea of this constant range 
method was originally used on a post hoc 
basis by Parducci and Perrett (1971, pp. 
445-447) with the arbitrary assumption that 
w= 1/2. The model did not do well in 
these tests, but no information is given that 
Suggests the nature or cause of the discrep- 
ancies, 

The second, range-extension method com. 
pares across stimulus distributions that have 
different ranges. To illustrate the essential 
idea, consider several distributions that are 
identical over their lower domain, from S, 
up to some medium stimulus Sw. These dis- 
tributions are assumed to differ above Sy 
by having, in particular, different upper end 
stimuli, Sy, for distribution i, 

Any stimulus in the lower domain neces- 
Sarily has the same frequency percentile in 
all distributions because they are identical 
over the lower domain. Furthermore, the 
Tange percentiles of these stimuli are lin- 
early related by virtue of Equation 1, Thus, 
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the range percentiles for distributions 1 and 
i obey the relation, 


VG) = (S), (14) 


is i bjective ranges 
/here c; is the ratio of the sul 
of the two distributions, namely, [V*(Sy) 
— V*(St)\/(V*(Sci) — V*(51)]. 
Under these conditions, the range-fre- 
quency model of Equation 4 becomes 


i - Vi(S) + (1— w)F(S). 
Ri(S) = we Vi(S) ae 
Since F(S) is the same for common stimuli 


in the lower domain, subtraction among dis- 
tributions i, j, and & yields 


R((S) — Rj(S) = (c — ¢;)wVi(S), 
RAS) — R&(S) = (ci — ex)wV4(S). | 
(16) 
Accordingly, the two sets of differences on 
the left should plot as straight-line functions 
of each other, because both are proportional 
to-Vi1(S). Further, if this test is satisfied, 
then the differences provide an estimate of 
the complete psychophysical function Vi(S). 
'This test is not statistically optimal, but 
it does allow a quick inspection of the data. 
A formal and more efficient statistical test 
can be obtained by viewing Equation 15 as 
a compound multiplying-adding model. The 
bilinear test for multiplying models does not 
apply directly, but the model still has a semi- 
linear form that allows direct application of 
analysis of variance( Mandel, 1964, Section 
9.4). In this test, the common stimuli in 
the lower domain constitute one factor of the 
design, and the several distributions consti- 
tute the other factor. This test also holds 
with F(S) replaced by PF), as is sug. 
gested by Equation 16, in which the fre- 

quency value has cancelled out. 
For the generalized range-frequency model, 
an exact test can be obtained by combining 
the two test methods discussed above, To 
illustrate, consider two distributions with 
the same end stimuli. Choose a larger upper 
end stimulus and construct two new distri- 
butions by stretching out the two Original 
distributions in such a way that they remain 
unchanged below some intermediate value 
Sy. Any common stimulus less than Sy 
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will then have the same frequency percentile 
in either distribution and its extension; it 
will also have the same range percentile in 
each pair of distributions with the same end 
stimulus. Thus, the theoretical v 
be expressed in a 2 x 2 array: 


wVi(S) + (1- w)Fi(S) 


; wV¥o(S) + (1 —w)PS) 
wVi(S)- (1 — w) Fo(S) 


wi.) + (1 —w)Pa(S), 
(17) 


generalized 
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two different stimulus distributions. As 
noted above, this result is contrary to Hel- 
son’s (1964) theory of adaptation level but 
agrees with the prediction from range-fre- 
quency theory. TOR 
Birnbaum also gave a quantitative treat- 
ment of the equal-frequency model of Equa- 
tion 4, expressed in somewhat different 
notation. An interesting feature of his anal- 
ysis was that V (S) in Equation 4 was re- 
placed by a power series in the objective 
number metric of S. The power series ap- 
proach has the special advantage that it re- 
quires only the data from a single distribu- 
tion to get a test of fit plus an estimate of the 
;chophysical function. The test of fit, 
psy ever, is not exact. To establish the basic 
or therefore, it would seem preferable 
14 use one of the above exact tests that re- 
is no assumption about the form of the 
uires hvsical function. Once the basic 
lel has been established, then power se- 
mode d other approximate methods can be 
S gend for estimation. 
yep final question remains to be consid- 
One That is the assumption that the overt 
ered. is an interval measure. Within the 
response uency model, response nonlinearity 
d ally distinct from the unequal- 
is concep assumption, and even the general 
frequen uation 8 can fail because of non- 
model 0 A the overt response. In prin- 
linearities nse nonlinearity can be rectified 
ciple, 5 transformation. Thus, Birn- 
by mtm and Gifford (1971) found 
baum, on from the model, which they 
discrepant nonlinear response bias, and 
attribute attempted to remove by mono- 
which they ws transformation. Their anal- 
tone respon ed on the equal-frequency as- 
sis was bas ever, and possibly that as- 


: how 
sumptio™ was at fault rather than the re- 
n was 
sumptio 


ale. x 
igo sential problem of response trans- 
The £ 


.> can, in principle, be handled by 
formation EA measurement in connection 
using ae versions of Array 17. If the 
with large test succeeds, that supports both 
i^ d model and the interval prop- 
di measure. If the paral- 


response dico 
he ; ta owing only to nonlinearity 
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in the response measure, then parallelism 
can be restored by monotone rescaling (An- 
derson, 1974a, Section II.A.6). 

To sum up, functional measurement pro- 
vides an effective analysis of the range-fre- 
quency model under the most general condi- 
tions. Itis not necessary to make the equal- 
frequency assumption, Nothing need be 
assumed about the psychophysical function. 
Neither is it necessary to have a linear or 
equal-interval response scale. All that is at 
issue is the basic algebraic structure of the 
model itself; that constitutes the base and 


frame for the functional measurement anal- 
ysis. 


Notes on Experimental Design 


There are various points of design and 
procedure that should be kept in mind in 
experimental applications of the above re- 
sults. One of the foremost relates to the 
assumption of continuity in the response 
scale that was used in the theoretical anal- 
ysis. In practice, most investigators have 
used a small number of categories, and this 
discreteness may introduce biases that are 
not represented in the model. Although this 
problem may not be overly serious, the use 
of a continuous graphic rating scale has 
much to recommend it. Not only would it 
essentially bypass this discreteness problem, 
but it would also help eliminate sequential 
effects from previous trials on the response 
to the present stimulus (Anderson, 1969, 
1971, 1972b ; Parducci, 1974). 

Since the iterative technique depends 
heavily on the equal-frequency assumption, 
it would seem desirable to facilitate it by in- 
structions. The instructions that have been 
used often seem likely to facilitate unequal- 
frequency tendencies. Of course, such in- 
structions would be appropriate for studying 
the unequal-frequency assumption itself. 

The data analysis is much simpler if the 
rating response is an interval scale, The 
experimental work on information-integra- 
tion theory has used rating scales with good 
success, and this work Suggests that only a 
few modest precautions are necessary (An- 


derson, 1974a, Section TLA 7: 1974d, Sec- 
tion 2.1), 
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ecaution that deserves special men- 
du the use of end anchors, special M 
that are intended to define and tie down iu 
usage of the rating scale. They are wee y 
chosen to be just noticeably more rit 
than the regular experimental stimuli. ' E 
are presented in the preliminary uns o 
help set up the frame of reference an may 
be repeated during the course of the ipu 
ment to keep response usage stable. here 
is ample (if scattered) evidence for a d 
of end effects on both the response and the 
stimulus sides. The stimulus end anchors 
offer the promise of fractionating out oes 
end effects from the pepe Sees and they 
her advantages as wet. 
Ee is some reason to suspect that the 
use of stimulus end anchors, together with 
preliminary practice to set up the stimulus- 
response frame of reference, can markedly 
reduce the distribution effect. That could 
reduce power in the tests of the range-fre- 
quency model. However, the use of stimulus 
end anchors does not seem to violate any 
assumptions of either the iterative technique 
or of range-frequency theory, and it is rec- 
ommended that they be used more widely. 
Finally, it is worth noting that much of 
the above analysis can be adapted for use 
with verbal or symbolic stimuli. For ex- 
ample, the subject might be asked to judge 
likableness of persons described by person- 
ality-trait adjectives OT by photographs. 
Since such stimuli lack a physical metric, it 
would be necessary to obtain an auxiliary 
rank ordering of the stimuli, pee 
a single-subject basis, to primes He ín 
quency percentiles. T hese percenti se 
be subject to error, which. would m o be 
considered in the statistical analysis (see 
e.g., Isaac, 1970). Other than that, hs 
tions such as Equation 12 would app y z 
well to symbolic stimuli as to psychophysical 
stimuli. Equation 13 could then be used to 
obtain V (S). which, though aranca 2s a 
psychophysical function, | could provide a 
convenient method for stimulus scaling. 


CoMMENTS AND CONCLUSIONS 


Iterative Technique 


The present analysis helps clarify the na- 
ture and usefulness of the iterative tech- 
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nique. 


It has two limitations and one ad- 
vantage. 


'The first limitation is the lack of 
protection against failure of its basic equal- 
frequency assumption, that the subject has 
covert tendencies to use the response cate- 
gories equally often. The iterative tech- 
nique is a statistical routine that applies the 
same regardless of whether the tendencies to 
use the response categories are equal or un- 
equal. When these tendencies are unequal, 
however, the iterative technique does not 
neutralize them but instead injects an addi- 
tional bias of its own. 

AW he second limitation of the iterative tech- 
aique is that it provides no basis for inter- 
preting the limit response function even 
tae ater assumption is true. 
AR TAE dais ape on establishing 
T MTS Ar Es such as the range- 
function and indeed i; Pe. 3 ae nar 
nique itself arises os thes es tehi 
stantive model. "RU UE RIS 
oon ie limitations, the iterative 

mique may have unique potential for 

scaling. If the equal-frequency as i 
is true, then the limit res Ve mar 
sponse function will 
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FUNCTIONAL MEASUREMENT DIAGRAM 
eee eines, 
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5, PEL es --->---> r ———————- R 
E "a 
S Sas Ns Š 
Psychophysical Psychological Psychomotor 
Law Law Law 
s =V(S) F= T Suisses) R = M(r) 


FIGURE 2. 


Functional measurement diagram. 


(Physical stimuli S im- 


pinge on the organism and are converted by the valuation operation or 
psychophysical law I” into psychological stimuli s. The psychological stimuli 
are combined according to the integration function or psychological law J 


to yield an implicit response r. 


The implicit response is transformed into 


the overt observable response A according to the output function or psy- 


chomotor law M.) 
sions. In Stevens' view, even the graphic 
rating scale is a "partition" scale and so not 
in general an interval scale. The present 
interpretation requires justification, there- 
fore, and that will be given in two sections 
below. Before doing so, it is necessary to 


clarify what is meant by the term interval 
scale. 


Meaning of Interval Scale in Functional 
Measurement 


As used here, the term interval scale re- 
fers to a linear relation between two quanti- 
ties. Typically, one quantity is observable 
and is used as a measure of the other quan- 
tity, which is unobservable. Accordingly, 
the above statement, “the rating response is 
an interval scale" is ambiguous because only 
one quantity is explicitly specified. There 
are two candidates for the other quantity, 

The customary approach takes this other 
quantity to be the subjective magnitude or 
"sensation," here denoted by V (S). 
distribution effects would in general 
duce a nonlinear relation between |” (S) 
the overt response R(S), the overt re 
would not be an interval scale, 
yestigator like Stevens, who 


Since 
pro- 
and 
sponse 
To an in- 
aimed at de- 


termining the psychophysical function by 
direct estimation, this nonlinearity between 
R(S) and V(S) would naturally be con- 
sidered as an undesirable bias. 

A different conceptualization is embodied 
in functional measurement. The other quan- 
tity in the interval scale relation is not 
V (S), but rather a covert response. This 
conceptualization is made explicit in Figure 
2, which presents a reduced version of the 
functional measurement diagram (Ander- 
son, 1970b, Figure 1). In this diagram, 
physical stimuli, denoted by S, impinge on 
the organism and are processed by the valu- 
ation operation V to become psychological 
stimuli, denoted by s. Thus, s = V (S) de- 
fines the psychophysical function. These 
psychological stimuli are then integrated ac- 
cording to a psychological law 7 to produce 
a covert response 7. The overt response R 
is said to be an interval scale when it is 
linearly related to r. 


A vital feature of Figure 2 is the distinc. 
tion between s and r. The customary ap- 
proach, based on judgments of single stim- 
uli, aims at using R as a direct measure 


of s, ignoring r, or assuming r and s to be 


474 


linearly related. The functional measure- 
ment approach aims at using R as a measure 
fr. 

The necessity of and the justification for 
the functional measurement approach are 
well illustrated by the size-weight illusion 
(Anderson, 1970a). Felt heaviness, which 
is the subjective magnitude that is accessible 
to introspection, results from a preconscious 
integration of information from two stimulus 
Sources, gram weight and visual appearance, 
A measure of felt heaviness will not provide 
the psychophysical function for gram weight 
because the heaviness sensation is also af- 
fected by visual appearance. It might Seem, 
therefore, that the visual appearance consti- 
tutes a biasing contextual effect that should 
be eliminated by experimental means, 

Nevertheless, it is still a legitimate and 
important question whether or not the overt 
response is an interval scale of the conscious 
Sensation of heaviness, In terms of Figure 
2, this question reduces to asking whether 
or not R is a linear function of r. 

This question is decidable. The basis of 
decidability is the psychological law 7. If 
I is an additive function, for example, then 
the data from a factorial stimulus design 
should plot as a set of parallel lines, Op. 
served parallelism thus Provides joint sup- 
port for the additive model and for the inter- 
val property of the response scale, 

Furthermore, this approach can also yield 

the psychophysical law. For if the additive 
model is established, then the marginal 
means of the factorial design estimate the 
s values on validated interval Scales ( Ander- 
son, 1970b, 1974a). In the SIZe-weight ex. 
ample, the row means would constitute an 
interval scale of the subjective value of vis- 
ual appearance—as it functioned within the 
given task. Similarly, the column means 
would constitute an interval scale of the sub. 
jective value of gram weight. The graph of 
these subjective values as a function of the 
corresponding objective, physical values de- 
termines the psychophysical law (Anderson, 
1972a, Figure 3; Weiss, 1975, Figure 2). 


Rating Scales as Interval Scales 


Experimental work on information jnte- 
gration theory has yielded extensive eyj. 


Norman H. ANDERSON 


e , the primary 
ome integration function 
Y combining Stimulus variables in fac- 


there Ing algorithm and 
additive Es ie Clear failures of th 
a » OSE notably ; 
cept: Y !n perso - 
RSN nderson, 19744, edis 2) Ae 
Shanteau (Al work on utility theor b 
tion iw ( 74, 197 3). In erson PR 
tablish Unately, it has been Possible to s 
adding = der raging Model in place of 5s 
stow m pois d i general th vidence 
> S Y te rati 
js e Simple Cognitive ‘igen tasks do 
ating scales are 
À » Of cour. : 
vari Se, 
i OUS biases, and certain expe Subject to 
ee €cessary to p menta] pre 
Scale, 
end of ¢ h atter was di cited i 
distributi Part of this tic] v. 
mark Ait T €Serves eo UTR 
P h 5 €parate 
theoretical] g : i € distribut ect i 
the range-f Actionateg tb 
be und l üc el, d T us pr 
In SÍ bers nm 5, would 
integrat ll the kD Xperiments 
mental i! cory ied Ormation 
effect “utions iminat, Seri. 
Ii q ate d ributig 
Would | bution effects 
Nonpar 1l ly pri duce a m t, they 
roduc EESTE ern of 
norm 4S combinat: ds t 
wis Hike qi "ea have 4 
alue combination agnitude 
treme Pro Fessively f i &rmediar 
H Or 
freque ten strib er ex. 
of all eg, inati o Would shi 1€ equal- 
Stimul 23 Outwa "dgment 
Osite siq the cente 


CONTEXT EFFECTS 


shift iu opposite directions, and that would 
produce systematic discrepancies from par- 
allelism. However, this pattern of discrep- 
ancy is seldom found, which indicates that 
distribution effects can be largely eliminated 
by appropriate experimental precautions. 
Equally, it indicates that the rating method 
can yleld interval scales. 


Magnitude Estimation 


Functional measurement is basically neu- 
tral in the controversy between ratings and 
magnitude estimation (Anderson, 1970b, p. 
166). In functional measurement, the basic 
entity is the psychological law, that is, the 
integration function of Figure 2. That con- 
stitutes a substantive criterion for assessing 
the validity of any proposed response mea- 
sure. Ratings and magnitude estimation 
have equal opportunity to meet this criterion 
of consistency. Ratings have done well 
in numerous tests of algebraic integration 
models (Anderson, 1974a, 1974b, 1974d), 
whereas magnitude estimation has 
poorly. 

The first evidence on this point was ob- 
tained by Stevens himself: “Use various 
standards," advised Stevens, “for it is risky 
to decide the form of a magnitude function 
on the basis of data obtained with only one 
standard" (1956, p. 6). But Stevens! own 
Figure 4 shows that different standards 
yielded markedly different results, Subse- 
quent work has confirmed Stevens' discrep- 
ancy and has shown that many other factors 
also influence the exponent of a power func- 
tion fit to the data (see Jones, 1974; Marks, 
1974; Poulton, 1968). These effects are just 
as much undesirable biases as the distribu- 
tion effects for which Stevens criticized rat- 
ing scales. They bias the exponents, and no 
satisfactory way has been given to decide 


which is the true, unbiased value, if one in- 
deed exists. 


done 


Other consistency criteria may also be ap- 
plied to evaluate the method of magnitude 
estimation. In particular, numerous at- 
tempts have been made to establish algebraic 
models using magnitude estimation as a re- 
sponse measure. Much of this work has 
been summarized elsewhere (e.g., Anderson, 
19742) and will not be repeated here, Be- 
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sides the work of the present author and his 
associates (e.g, Weiss, 1972, 1975), the 
work of the Scandinavians (e.g, Ekman, 
1958; Goude, 1962; Sjóberg, 1966, 1971) 
and the Oregonians (eg. Beck & Shaw, 
1968; Curtis, Attneave, & Harrington, 1968; 
Fagot & Stewart, 1969) deserves special 
mention for their systematic attacks on the 
problem. 


Overall, magnitude estimation has done 
poorly in these tests. That has led Stevens 
(1971, 1974) to argue that the experimental 
task itself imposes a bias. This ad hoc 
argument is not very convincing, however, 
in view of the success of the rating method 
in many of these same tasks. 

A different resolution has been suggested 
by Marks (1974), who has argued that rat- 
ings and magnitude estimation are both 
valid but measure different perceptual quali- 
ties. Ratings are seen as measures of sen- 
sory dissimilarity, whereas magnitude esti- 
mations are seen as measures of sensory in- 
tensity. In Mark's view, then, one set of 
consistency criteria would apply to rating 
scales, another set to magnitude estimation. 

This interpretation, which is akin to that 
of Treisman (1964) and Eisler (1965), is 
reasonable and deserves consideration. 
However, there are at least two grounds for 
skepticism. Since magnitude estimations do 
not generally satisfy the simple models for 
adding and averaging tasks, whereas ratings 
do, Marks’ position makes it necessary to 
argue that these tasks themselves, regard- 
less of the response measure, cause the sub- 
jects to perceive the single stimuli in terms 
of dissimilarity rather than magnitude. This 
view is plausible for judgments of differences, 
but it is hard to see why a set to judge sums 
or averages should induce a reaction of dis- 


similarity, nor is it clear how that might 
operate with more than two stimuli. More- 
over, it would seem that perception of mag- 
nitude is prerequisite to perception of dis- 
similarity in at least some of the judgments 
in question. Having perceived the magni- 
tudes, there is no apparent reason not to 
add or average them directly. 


Of more immediate relevance to Marks’ 
position is whether magnitude estimation 
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will in fact satisfy consistency eriam E 
effects already noted of various contex m 

d procedural factors on the expone 
m eter present a serious inconsistency 
Sor which no resolution Seems to be in 
ight Furthermore, cross-modality Matching, 
which seems to be the main current con- 
sistency criterion for magnitude estimation, 
is not. only ambiguous theoretically (Att- 


neave, 1962; Ekman & Sjoberg, 1965 i 
Treisman; 1964 ; Zinnes, 1969), but also suf- 


fers its own inconsistency by yielding dif- 
ferent exponents depending on which dimen- 
sion is used as the response, _ Stevens and 
Greenbaum (1966) have applied the name 
"regression effect" to this inconsistency but 
that neither resolves it nor even explains 
it (Kappauf, 1975, p. 407). 

It should be emphasized again that the 
controversy between ratings and magnitude 
estimation is not basic from the functional 


measurement point of view. Each response 
measure is a sim 


aic models, 


sponse is a con- 


ern with get. 


ponse Measure js Well taken, 


atter of Convenience. interval 
measures facilitate Progress by an Order of 


Magnitude Compared to rank-order mea- 
sures. More important, an interya] T€sponse 
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t * approach (An. 
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ases the distribution effect is largely if not 
'ntirely a nonperceptual language effect 
(e.g, Anderson, 1974a, Section ILB.8.c; 
|974b, Section III.F.6; Upshaw, 1969). The 

roblem is complicated, however, by the fact 
hat true perceptual contrast does seem to 
ippear under certain conditions (see Ander- 
son, 1971, p. 61). Stimulus integration 
tasks may help clarify this issue. In hetero- 
modal adding (Anderson, 1974c), for exam- 
ple, it may be feasible to distinguish re- 
sponse equalization of judgments of total 
magnitude from truc perceptual changes in- 
duced by varying the distributions of stimuli 
| within the separate dimensions. 


Context Effects: Silk Purses from 
| Sows’ Ears 


Functional measurement provides an im- 
portant perspective on the study of context 
effects. In the traditional approach to psy- 
chophysical measurement, context effects are 
ordinarily treated as undesirable biases 
(Stevens, 1971, p. 448). With functional 
measurement, however, it becomes possible 


to utilize context effects as a tool for mea- 
surement, 


The size-weight experiment is a good ex- 
| ample. The visual appearance does not have 
| a sensory connection to felt heaviness in the 

direct way that the kinesthetic receptors do. 
Its effect upon the judgment is contextual. 
To an investigator oriented toward direct 
measurement of heaviness, the effect of the 
visual appearance would constitute an unde- 
| sirable bias to be eliminated experimentally. 
| In the functional measurement approach, 

however, this context effect can be put to 
good use, It has, of course, primary sub- 


stantive interest of its own. And because it 
obeys a simple algebraic model, it provides 
a way to measure the subjective value of 
gram weight per se. Instead of being an 
undesirable bias, the context effect provides 
a base for validated measurement. This ex- 
ample has special significance because the 
resultant heaviness scale agrees with scales 
obtained from other tasks that are perceptu- 
ally quite different, such as weight averaging 
and cross-modal adding (Anderson, 1970a, 
1972a, 1974c). Such cross-task validation 
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agrees with the emphasis on converging 
operations by Garner, Hake, and Eriksen 
(1956). It also suggests an interesting cog- 
nitive economy in which the same internal 
sensory representation appears in perceptu- 
ally diverse tasks. 

Much the same method and moral hold 
for the contextual distribution effects in 
Parducci's range-frequency model. These 
effects have substantive relevance to a gen- 
eral theory of category judgment. But they 
also can provide a means for getting an in- 
terval scale of the perceptual value V (S) 
(Anderson, 1974a, Birnbaum, 1974; Ander- 
son, Note 1). Here again, a context effect 
is transformed from an undesirable bias into 
a working tool. 

This perspective on context effects is not 
novel, for it can be seen in the work of many 
investigators. The most dramatic example 
comes from the theory of signal detection 
(Green & Swets, 1966; see also Anderson, 
1974a, Section II.B.9). Here the successful 
analysis of the Yes-No task depended on re- 
versing the traditional attitude that bias fac- 
tors should be eliminated, and instead ma- 
nipulating them experimentally in order to 
partial them out. 

An interesting example with semantic 
stimuli is given by the positive context ef- 
fect in person perception (Anderson, 1966, 
1974d, Section 4.3). When one trait adjec- 
tive in a person description is judged, its 
rating shifts toward the values of the other 
traits. There has been some controversy 
whether this positive context effect is to be 
interpreted as a change in the meaning of the 
rated trait, or merely as a semantic illusion 
(Anderson, Note 2). However, since the 
effect obeys a simple integration model, it 
may be used as a basis for scaling regardless 
of its interpretation. 

Other illustrations of this perspective on 
context effects are discussed elsewhere ( An- 
derson, 1974a, 1974d, in press), and a few 
may be cited here. Birnbaum, Kobernick, 
and Veit (1974) report a study of numer- 
osity judgments that is important for its 
account of contextual learning effects. Res- 
tle (1971) and his associates have attempted 
to apply adaptation-level theory to the anal- 
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is of geometrical illusions. The e 
dod A detection theory for scaling las 
s IR above. The work of Price-Wil- 
ns t 1954) and others suggests xd Pu 
actions of time and distance may a M S 
simple algebraic models. Such amr o 
tegration, like that in the size-weig pue 
sion, can be useful as a natural basis. o; 
scaling both conscious and preconscious 
Perd effects may thus be widely useful. 
Systematic manipulation of a context pieri 
can provide a base and frame for validate 
measurement scales. This contextual mea- 
surement approach is not a routine scaling 
procedure, of course, since it depends on ob- 
taining a valid, substantive model for the 
context effect. In this functional perspec- 
tive, measurement theory becomes an or- 
ganic component of substantive theory. 


Psychophysical Law and Psychological Law 


The psychophysical law is the function 
that relates the physical stimulus to its sub- 
jective counterpart. The psychological law 
is the function that relates the subjective 
stimulus values to the covert response (see 
Figure 2), An investigator may choose to 
focus on one or the other of these laws. 
This choice can have far-reaching conse- 
quences, in t perimental task, of 


erms of ex 
design and data analysis, and of theoretical 


direction. 
"This distinction is well 
difference that may seem 
but is fundamental, The 
law, as ordinarily conceived, i 
one variable. This one- 


' 


illustrated in a 
mere] 


ties 
It leads 
Y of judgments 
edures intended 
imulus Situation 
can be ignored, 
in Contrast, is or. 


in which contextual factors 

The psychological law, 
dinarily a function of two or more variables. 
"This several-variable Tepresentation is a nat. 
ural consequence of a concern with 


Stimulus 
integration. Indeed, the PSycholo. 


gical law 
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is the integration function for a multiple 
sti field. A VESNI 
ES ae E variable me 
tion makes a decisive difference for p A 
ment theory. A basic problem for vasis 
theory of measurement is to provide a n 
for transforming an ordinal scale into eos, 
terval scale. A function of only one Mo. 
is inadequate because it provides no iue 
straints on such monotone transformatio ; 
But with two variables in a factorial peli 
it is possible to allow for monotone E ed 
formation and still retain degrees of M 
dom for testing the psychological law ( i 
derson, 1962, 1970b). Thus, the p 
logical law is fundamental, for its substantiv E 
interest and as the basis for Evae) 
From this standpoint, it can be argue 

that the one-variable pursuit of the pm 
physical law has had ill effects, Not dd 
is the one-variable approach too narrow t 

Support a theory of measurement itself, but 
it has misdirected inquiry away from the 
more fundamental psychological law. This 
one-variable-many-variable distinction was 
EDEN Qr a 


sts of algebraic models even with 
rank-order data, The first method can be illu- 
Strated for an adding model tested in a two-factor 
design. Available computer programs develgpi® 
in multidimensional scaling (e.g, Kruskal, 1965; 
applied to make the date as 
additive as Possible separately for each replica- 
€ deviations from additivity are 
all intercorrelateq Within each replication, pe. 
independent across Tepe ae 
ordinary measuremen 


z YDOthesis of zero interaction. An 
invalid mode] w 

pattern of nonadditivit: 
SO yield a signifi 
has a bias to re 
bias would appea; 


her 
the response, the oth 

In the geo MS 
» for example, the respons 
Would be transformed to make (A + B) as addi- 
tive as Possible, The 


à onse should 
then be an interva] sca pes pec] directly 
in the analysis of variance to test the model-implied 
null hypothesis of no two-way interaction between 

and the compound variable (A + B). 
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the basic issue in the above discussion of 
context effects. Other aspects of this same 
issue arise in the four following statements, 
which represent dominant themes or ideas 
in current psychophysics. 

Sensation is conscious. This idea is im- 
plicit in the various methods of direct esti- 
mation, since they ask for introspective re- 
ports. However, consciousness is a window 
on only one stage in the stimulus-response 
chain. In the size-weight illusion, as noted 
above, the sensory effect of the physical 
weight is preconscious, not accessible to in- 
trospection. The same holds for many of 
the physical cues that govern judgments of 
size and distance or of taste. 

These examples are typical of many, per- 
haps most perceptual tasks. They involve 
stimulus integration, often across different 
sense modalities at a preconscious stage. 
Measurement at the conscious stage will pro- 
vide a direct linear scale of preconscious 
sensory effects only under special conditions. 
In general, such scaling will require meth- 
ods than can penetrate behind the conscious 
sensation to dissect it into its preconscious 
components, 

The link between the physical stimulus 
and the conscious sensation can be treated 
as a single sensory transducer. This idea 
was explicit in Stevens' (1960) attempts to 
use the exponent of a power function fit as 
an invariant characteristic of the sensory 
transducer, and it perseveres in his later 
writings (e.g., Stevens, 1974, p. 370). More 
generally, it seems implicit in the quest for 
a single rational form for the psychophysical 
law. 

This idea is not reasonable from either a 
physiological or a psychological view. Physi- 
ologically, many successive stages lie between 
the physical stimulus and the conscious sen- 
sation. To treat their combined action as 
a lumped unit may have practical utility, 
but such a lump seems unlikely to have 
much physiological meaning. —Psychologi- 
cally, conscious sensations are in general re- 
sultants integrated across different sense mo- 
dalities, as already noted. Being sensory 
compounds, they cannot in general be treated 
as single sensory representatives. 


479 


The psychophysical law has a single, sim- 
ple mathematical form. Perhaps no idea has 
been more attractive in the history of psy- 
chophysics. Fechner’s method of cumulat- 
ing just noticeable differences would prob- 
ably have attracted minor interest except 
that Weber's empirical law led to a theo- 
retical logarithmic function. Later workers 
have often argued that some other simple 
law should apply, but few have been willing 
to abandon the hope for simplicity at this 
level. This hope seems to have rested 
largely on the feeling that the psychophysical 
law was somehow fundamental, and by na- 
ture deserved to have a simple form. In 
part also, the one-variable approach had lit- 
tle alternative but to hope for some simple 
form to appear. 

Yet this idea is not credible from either a 
physiological or a psychological standpoint. 
The reasoning here is much the same as for 
the previous idea. Furthermore, the psy- 
chophysical function must be expected to dif- 
fer between senses and across organisms. 
Log and power functions may be quite use- 
ful as rough numerical approximations, but 
it is unlikely that they will be exact or gen- 
eral. Much of their popularity seems to 
have resulted from lack of any pragmatic 
alternative. The many-variable approach 
provides such an alternative, by not attempt- 
ing to impose another arbitrary assumption 
and instead providing a method to detect 
how nature actually operates. 

The psychophysical law has fundamental 
importance. The sources of this idea are 
not entirely clear. In part, it seems to stem 
from Fechner’s original concern with the 
mind-body problem, in part also from the 
idea, noted above, that the psychophysical 
law represents a single sensory transducer. 
And in part, perhaps, it was felt that dis- 
covery of the psychophysical law would solve 
the problem of measurement, Unfortu- 
nately, this orientation toward the psycho- 
physical law has led to methodology that is 
too narrow to resolve its own problems (An- 
derson, 1970b). 

In the functional measurement approach, 
it is the integration function or psychological 
law that is basic. It provides the foundation 
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conceptually and 
tes. Conceptually, the measurement 
technically. their existence and definition 
scales attain hé integration: functor Tech 
by AW Du d function provides the 
STN scaling the response and also for 
E. 5 stimuli, Measurement is thus 
E cional with the psychological gef ls 
In this approach, the psychophysica an 
is E E from the R a By 
sychophysical law is no longer funda 
m E theretore. and it loses much of the 
HMM nd significance that have been at- 
dm " it What interest remains may 
es ; reside in viewing the subjective stim- 
M ues not as primitive sensations but 
d as the next object of analysis, them- 
selves perhaps the product of an earlier in- 
tegration stage. ! 
The essential themes of functional mea- 
surement can be seen in much previous 
work. The study of stimulus integration is 
time-honored in perception, and many ear- 
lier investigators have developed various as- 
pects of the functional measu ) 
proach. Even a partial catalog of algebraic 
models for perception (Anderson, 1974a) 
shows how numerous such attempts have 
been. Of historical interest are Thurstone's 
(1927) theory of comparative judgment 
(see Anderson, 1970a, p. 163; 1973) and 
the work on loudness summation by Fletcher 
and Munson (1933) and on loudness frac- 
tionation by 


Geiger and Firestone (1933). 


Among Present-day writers, special men- 
tion should be 


made of the work 
Birnbaum (1974) i j 


rement ap- 


3 er 
, and Erik: 


ton (1968), Ross and DiLollo ( 
berg (1966), Sternberg (1969) , Treisman 
(1964), and Weiss (19 whose views 
overlap in various respects with the present 
discussion. 

Functional measurement does provide a 
unified treatment of much Previous work. 
Beyond that, it represents 4 Significant ad- 
vance in matters of techni 


que, and in em- 
pirical accomplishment, 


Conceptually, how- 
ever, it may best be considered asar 


eunit- 
ing of measurement theory with the common 
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iricis 1 this way, 
sense of common empiricism. l 


its organic role 
measurement theory resumes its a vx 
in substantive theory with mutua pe 
ening. It is to be hoped that the Cu 
psychophysics will in similar way rediss 
into the ocean of perception. 
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THEORETICAL NOTE 


Parallel Psychometric Functions from a Set of Independent Detectors 


David M. Green 
Laboratory of Psychophysics, 
Harvard University 


Consider combining several elementary 


rule of responding "no" only when all el 


“yes” otherwise. The question raised is: 
detectors have the property that the resulting psychomet: 
the original function displaced in the logarithm of the 


answer is p(7) = 1 — eel? a 8 > 0, 


Detection data are often assumed to arise 
by combining the decisions of a number of 
more elementary detectors. One extreme as- 
sumption is that the elementary detectors are 
binary and the final decision is affirmative if 
and only if at least one detector reports a 
detection. Obviously a number of other 
decision rules are possible—one might, for 
example, require some fraction of the ele- 
mentary detectors to report detection. Such 
rules are more flexible and may prove more 
prudent or realistic in particular detection 
situations. Smith and Wilson (1953) have 
analyzed such systems in some detail. The 
appeal of the extreme rule is its simplicity, and 
so it is not surprising that discussions of this 
extreme rule occur in many different areas as 
a kind of null model. [n certain classical 
psychophysical experiments, for example, the 
influence of signal duration or area on detec- 
tion can be treated in this way, as can complex 


signals composed of one or more separate 
parts. 


The psychometric fun 
such experiments exhibit a Surprisingly 
relation. Consider, for example, two different 
signal durations. Presumably the shorter 
duration involves fewer elementary detectors 
than does the larger one, but when one mea- 


ctions observed in 


simple 
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detectors by the extreme decision 
ementary detectors respond “no” and 
Which psychometric functions for the 
ric function is simply 
physical scale? The 


sures the psychometric function at each dura- 
tion as a function of log intensity, one usually 
finds that both psychometric functions have 
essentially the same shape. That is to say, 
these psychometric functions are nearly 
parallel, the one simply being the other dis- 
placed to the left or right by an amount that 
is a function of the parameter value being 
varied—in this example, signal duration. 

Such results were interpreted. by Green, 
McKey, and Licklider (1959) as unfavorable 
to the putative model for the following reason. 
Let p denote the elementary detection prob- 
ability, and let 5, denote that of a system 
involving z independent elementary detectors. 


Assuming the decision rule stated, it is obvious 
that 


fac — Ba: 


Since the relation between Pn 
linear, it was argued that the shape of the 
psychometric function must change as n 
varies. Certainly this is true for a variety of 
simple psychometric functions, and if » is 
large enough, this change should be easily 
noticed, even in empirical data. 

However, as Quick (in Press) has pointed 
out, the function 


Bü) = 1 et, 


(1) 


and p is non- 


a0, 
has the property that Pn in 
the same form provided I i 
displaced by a function of n. Thus, on a 


logarithmic scale of intensity, »() and p, (2) 
are parallel. The argument is simple: 


Pn 


B»0 (2) 


Equation 1 is of 


f= ith — Bp 
d s (e7et8)n 
t= eTa QWBI)8. 


li 


So the function given in Equation 2 has the 
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en observed empirically. 
es an DN af chis note is to proye a 
n: a ie function with the property t at 
S ^i dens @ on the positive integers 
Become Sane probability function p, 
and so 


bn(T) = pLa (n)I]. (3) 


iving this result and commenting 
Op e rales briefly on two points. The 
a ic he issue of false alarms. If we denote 
CE h MEUS SD rate of an elementary 
ds ae then by Equation 1, the aon 
TR : ile, og OR which, 
AE eee approximately ne. It is 
m h tif the system has a large number of 
nea 3 its false-alarm rate will be modest 
ER de false-alarm value of each detector 
durae low. This is one reason that we 
NE the model as an extreme one. 
The second point to observe is tha 
2 has appeared in the detection literature at 
fede as early as Brindley (1963). He assumed 
that J is proportional to the number of quanta 
impinging on an elementary detector, that 
here are independent detectors, and that m 
cde are required to excite any one of them, 
He claimed to show—though we believe the 
argument wrong (see Appendix)—that for 
large n the process approaches that of Equation 
2 with 8 = m anda = n/m! 
The same mathemati 


at Equation 


the 


distribution, Th €y paper is 


e 
Tippett (1928); it is 


cerning the conditi 
tion to the c 
unique, 


Ott, Who has been Working on a 
related problem jn Choice theory (Note 1) 
brought this literature to i 


Our attention and 
showed us how to use Feller’ 


US results, 
m ce 


follows. first proved 
restriction of the function a(n) 


gested that a somewhat weaker 


Othesis we 
do, and we proved the results under hj ould 
Later, we read the draft oj n artic] 


(Note 1), in which he proved 4 related Y Yellott 
out assuming anything about the fi 
but assuming that p is strictly increasin 

tinuous. This brought to Our attention th pe 
connection of our problem to that of the statis 
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Form of the Psychometric Function 


metric 
THEOREM. Assume that the porto s 
function p is a ERU E fau 
tion on the nonnegative reals (i.e., it is in 


z A ld, 
and onto [0, 1]) and that Equations 1 and 3 hold 
then 


y 


LSS s (2) 
Proof. From Equations 1 and 2 ] 
C1 — 8)" = 1 — pia: 
Let 
: s(x) = 1 — p(1/x), (4) 


then it is easy to see that g is a Popp 
function, that g” is the distribution Sm (a 
largest of n independent random varia A a 
each with the distribution function g, and tha 


gx)" = [1 — b(1/x)] 
i bs (1/x) 

1 — pla(n)/x] 
g[x/a (1)]. 

So if X,isa random y. 
£", then X, /a (n) 


X 


P S) < x] 


(Equation 4) 
(Equation 1) | 
(Equation 3) 
(Equation 4) 


. . : ; ion 
ariable with distributio 
has the distribution 


Ska S a(n)x] 


= gla(n)x]" 

= g(x). 
(1966, 
n/a(n) te 


According to Feller 
distributions of X 
£ Not concentrate, 


Pp. 270-271), if E 
nd to a distributio 


Y at Ü—in this case, they a 
have the same distribution £—then 
g(x) = SUM. wx & 0. 
Thus, E FE 
PD) -1— gay } 

Exe E e «/I)n 

EX — arp B=—p>0 
Abblication 

If we Perform an ex 


H H s two 
periment involving 
Values of n 


i wO 
wo h Say ni and n2, and find tye rd : 
1 tensities, Ti and I» that produce the sa 
Probability of detectio 


n, it follows from ) 


Pn = 1 — ema(nvaryp 


5 with 
Yaet emes, In discussing these matters Der. 
Dt, the question was raised whether any 
sumptions à 
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that 
ni? = nI, 
and so 
log 21/719 


B log Z2/Ii 


For a variety of cases in vision and audition 
we know that doubling the duration of the 
stimulus—presumably rendering n/n = 2— 
is compensated by a stimulus change of from 
1.5 to 3 db (that is, 72/71 from 2! to 2). Thus, 
we estimate 8 in the range from 1 to 2. 

Note that it is important in estimating 8 to 
measure intensity or energy rather than some 
quantity, such as pressure, which is propor- 
tional to J}. In such cases the estimated 
value of the exponent will appear to be twice 
as large since, if 3*« 7 then 72 œ x28, 


Two Devices Displaying T his Property 


Suppose a Poisson device produces counts 
with an intensity parameter u =al®, the 
device is monitored for an interval of time A, 
and it reports a detection if at least one count 
occurs within the interval. The probability 
of no counts is e743, so 


P (detection) = 1 — e» 
Em]. ge. 


Such a device and the given decision rule 
yield the required psychometric function. 

The application of such a dete 
interesting. First, it seems likel 
device is plausible only for brief signals, since 
it assumes that a single count is taken as 
evidence for the occurrence of a Signal. Second, 
if one envisions this process as applying when 
signal duration is varied, then A is probably not 
varied because it would eventually generate 
high false-alarm rates. Rather the outputs of 
a set of independent monitors are combined, 
and the time-intensity trade is generated. 
The exact form of that trade depends on B, 
but it is possible to predict trades that 
equal-energy contours. 

As Wandell has pointed out to us, the proof 
of the theorem Suggests a second decision 
mechanism yielding Equation 2 as the psycho- 
metric function. Suppose the system notes 
which of several channels appears to be active, 
calculates a statistic for each, and compares 
the smallest value with a criterion (i.e., selects 
the largest among the reciprocals of the 
statistics). If n channels are active and Y, 
is the smallest statistic, and if there are con- 
stants a (2) such that a ()Y, has a distribution 


ction model is 
/ that such a 


are not 
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that tends to p, then p is of the form of 
Equation 2. 


Comparison with Other Psychometric 
Functions 


On the theoretical side, we have shown that 
Equation 2 is the only psychometric function 
having the property that if one of n detectors 
with that psychometric function triggers a 
response, the resulting psychometric function 
is exactly parallel (on a logarithmic axis) to 
the elementary one. Is this property of any 
empirical value? Is it possible to determine 
whether empirical data obey this rule? The 
answers to such questions depend on what 
alternative psychometric functions are con- 
sidered and the number of elementary detectors 
that one presumes are combined. It is im- 
possible to give a general answer to the ques- 
tion, but the following relations provide some 
insight into the nature of the problem by 
showing how to compute the expected devia- 
tion for whatever alternative psychometric 
function one wishes to consider. 

Let p(Z) be the psychometric function given 
in Equation 2. Let some other psychometric 
function be given by p(Z), and denote the 
difference between them by y(I)=p(I)—p(1). 
The following will be simplified if we omit the 
argument J and denote p(I) = p, p(I) = p, 
and y(I) = y. Consider 


pe = 1— OC gis = f — 1 — a 
=1— 0 = pad = pany 
— 1 2 
= n D ac ge yy 
- ** Binominal expansion, 
psi d d a 


Thus, 
[bn — pal < «(1 — g)1ly| 
n(n — 1) 4. 
det (1.— p)* tA... 
n? ^ n 
Saly 5 lvl jrit 
=mi — 1. 


Although the bound may be 
permit an estimate of the error. If one de- 
sires the error for the 7 detector case to satisfy 
0 < e«& 1, then it is sufficient that i 


generous, it does 


enl — 4 € e, 
that is, 


uly] € in(14- 9 


"p 
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and so 


z (3) 
le— P| = Il S 
is eee in applying this result is that 
Š ft does not know z; in fact, it is usually 
rosa ei from the data. If a situation can 
D sranged where z is known, such as n 
e TE of a signal or » dots in a visual field, 
Ld the requirement given by Equation 5 
can be tested. 


Summary 


d xtreme decision rule, namely 

Giyen RERE when at least one 
resp! tary detector reports a detection, we 
Pome an condition stated in Equation 3 
: ields Equation 2 as the only possible psycho- 
aman function. Some rough bounds on the 
parameters are given. Two mechanisms that 
produce such a Psychometric function are 
presented, and the relation between this 


function and some other psychometric func- 
tions is derived. 
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APPENDIX 


THE FAILURE OF 


Brindley (1963) claimed to show 


b» —1— (D E I4 n/a 
deed I — 1 


approaches, presumably uniformly, 


that 


Je)» 


ib Ce nia 
as n becomes large. This is true for m = 
since both equations ri 
is also true for 


measurable 5, 


- mg only first-order 
terms in Z, we need onl ns 
within the brackets of t 


he two equations, for 
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